Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Large-Scale Discovery of Disease-Disease and Disease-Gene Associations
37
Zitationen
7
Autoren
2016
Jahr
Abstract
Data-driven phenotype analyses on Electronic Health Record (EHR) data have recently drawn benefits across many areas of clinical practice, uncovering new links in the medical sciences that can potentially affect the well-being of millions of patients. In this paper, EHR data is used to discover novel relationships between diseases by studying their comorbidities (co-occurrences in patients). A novel embedding model is designed to extract knowledge from disease comorbidities by learning from a large-scale EHR database comprising more than 35 million inpatient cases spanning nearly a decade, revealing significant improvements on disease phenotyping over current computational approaches. In addition, the use of the proposed methodology is extended to discover novel disease-gene associations by including valuable domain knowledge from genome-wide association studies. To evaluate our approach, its effectiveness is compared against a held-out set where, again, it revealed very compelling results. For selected diseases, we further identify candidate gene lists for which disease-gene associations were not studied previously. Thus, our approach provides biomedical researchers with new tools to filter genes of interest, thus, reducing costly lab studies.
Ähnliche Arbeiten
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
2005 · 56.121 Zit.
Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks
2003 · 53.804 Zit.
Gene Ontology: tool for the unification of biology
2000 · 44.367 Zit.
The Protein Data Bank
2000 · 39.545 Zit.
KEGG: Kyoto Encyclopedia of Genes and Genomes
2000 · 38.815 Zit.