Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
AI-MARRVEL — A Knowledge-Driven AI System for Diagnosing Mendelian Disorders
41
Zitationen
23
Autoren
2024
Jahr
Abstract
BACKGROUND: Diagnosing genetic disorders requires extensive manual curation and interpretation of candidate variants, a labor-intensive task even for trained geneticists. Although artificial intelligence (AI) shows promise in aiding these diagnoses, existing AI tools have only achieved moderate success for primary diagnosis. METHODS: AI-MARRVEL (AIM) uses a random-forest machine-learning classifier trained on over 3.5 million variants from thousands of diagnosed cases. AIM additionally incorporates expert-engineered features into training to recapitulate the intricate decision-making processes in molecular diagnosis. The online version of AIM is available at https://ai.marrvel.org. To evaluate AIM, we benchmarked it with diagnosed patients from three independent cohorts. RESULTS: AIM improved the rate of accurate genetic diagnosis, doubling the number of solved cases as compared with benchmarked methods, across three distinct real-world cohorts. To better identify diagnosable cases from the unsolved pools accumulated over time, we designed a confidence metric on which AIM achieved a precision rate of 98% and identified 57% of diagnosable cases out of a collection of 871 cases. Furthermore, AIM's performance improved after being fine-tuned for targeted settings including recessive disorders and trio analysis. Finally, AIM demonstrated potential for novel disease gene discovery by correctly predicting two newly reported disease genes from the Undiagnosed Diseases Network. CONCLUSIONS: AIM achieved superior accuracy compared with existing methods for genetic diagnosis. We anticipate that this tool may aid in primary diagnosis, reanalysis of unsolved cases, and the discovery of novel disease genes. (Funded by the NIH Common Fund and others.).
Ähnliche Arbeiten
Trimmomatic: a flexible trimmer for Illumina sequence data
2014 · 68.673 Zit.
Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology
2015 · 31.585 Zit.
BEDTools: a flexible suite of utilities for comparing genomic features
2010 · 30.077 Zit.
HTSeq—a Python framework to work with high-throughput sequencing data
2014 · 22.508 Zit.
A global reference for human genetic variation
2015 · 19.736 Zit.
Autoren
- Dongxue Mao
- Chaozhong Liu
- Linhua Wang
- Rami AI-Ouran
- Cole A. Deisseroth
- Sasidhar Pasupuleti
- S. Kim
- Lucian Li
- Jill A. Rosenfeld
- Linyan Meng
- Lindsay C. Burrage
- Michael F. Wangler
- Shinya Yamamoto
- Martha Guadalupe Zárate Santana
- Victor L. Perez
- Priyank Shukla
- Christine M. Eng
- Brendan Lee
- Bo Yuan
- Fan Xia
- Hugo J. Bellen
- Pengfei Liu
- Zhandong Liu