Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Synthetic Data Generation by Artificial Intelligence to Accelerate Research and Precision Medicine in Hematology
85
Zitationen
29
Autoren
2023
Jahr
Abstract
PURPOSE: Synthetic data are artificial data generated without including any real patient information by an algorithm trained to learn the characteristics of a real source data set and became widely used to accelerate research in life sciences. We aimed to (1) apply generative artificial intelligence to build synthetic data in different hematologic neoplasms; (2) develop a synthetic validation framework to assess data fidelity and privacy preservability; and (3) test the capability of synthetic data to accelerate clinical/translational research in hematology. METHODS: A conditional generative adversarial network architecture was implemented to generate synthetic data. Use cases were myelodysplastic syndromes (MDS) and AML: 7,133 patients were included. A fully explainable validation framework was created to assess fidelity and privacy preservability of synthetic data. RESULTS: We generated MDS/AML synthetic cohorts (including information on clinical features, genomics, treatment, and outcomes) with high fidelity and privacy performances. This technology allowed resolution of lack/incomplete information and data augmentation. We then assessed the potential value of synthetic data on accelerating research in hematology. Starting from 944 patients with MDS available since 2014, we generated a 300% augmented synthetic cohort and anticipated the development of molecular classification and molecular scoring system obtained many years later from 2,043 to 2,957 real patients, respectively. Moreover, starting from 187 MDS treated with luspatercept into a clinical trial, we generated a synthetic cohort that recapitulated all the clinical end points of the study. Finally, we developed a website to enable clinicians generating high-quality synthetic data from an existing biobank of real patients. CONCLUSION: Synthetic data mimic real clinical-genomic features and outcomes, and anonymize patient information. The implementation of this technology allows to increase the scientific use and value of real data, thus accelerating precision medicine in hematology and the conduction of clinical trials.
Ähnliche Arbeiten
The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia
2016 · 10.132 Zit.
Human acute myeloid leukemia is organized as a hierarchy that originates from a primitive hematopoietic cell
1997 · 6.915 Zit.
Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel
2016 · 5.825 Zit.
Proposals for the Classification of the Acute Leukaemias F<scp>rench</scp>‐A<scp>merican</scp>‐B<scp>ritish</scp> (FAB) C<scp>o‐operative</scp> G<scp>roup</scp>
1976 · 5.592 Zit.
Genomic and Epigenomic Landscapes of Adult De Novo Acute Myeloid Leukemia
2013 · 5.110 Zit.
Autoren
- Saverio D’Amico
- Daniele Dall’Olio
- Claudia Sala
- Lorenzo Dall’Olio
- Elisabetta Sauta
- Matteo Zampini
- Gianluca Asti
- Luca Lanino
- Giulia Maggioni
- Alessia Campagna
- Marta Ubezio
- Antonio Russo
- Maria Elena Bicchieri
- Elena Riva
- Cristina Astrid Tentori
- Erica Travaglino
- Pierandrea Morandini
- Victor Savevski
- Armando Santoro
- Iñigo Prada-Luengo
- Anders Krogh
- Valeria Santini
- Shahram Kordasti
- Uwe Platzbecker
- María Díez‐Campelo
- Pierre Fenaux
- Torsten Haferlach
- Gastone Castellani
- Matteo Giovanni Della Porta
Institutionen
- IRCCS Humanitas Research Hospital(IT)
- Humanitas University(IT)
- University of Copenhagen(DK)
- Azienda Ospedaliero-Universitaria Careggi(IT)
- University of Florence(IT)
- Marche Polytechnic University(IT)
- Guy's Hospital(GB)
- King's College London(GB)
- University Hospital Leipzig(DE)
- Complejo Hospitalario de Salamanca(ES)
- Université Paris Cité(FR)
- Hôpital Saint-Louis(FR)
- Munich Leukemia Laboratory (Germany)(DE)