Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Validation of Generative AI Techniques for Synthetic Data Generation in Multiple Sclerosis Research: A Comparison with Real-World Evidence from the Italian MS Registry
0
Zitationen
28
Autoren
2025
Jahr
Abstract
Abstract Importance Large multiple sclerosis (MS) registries provide crucial real-world evidence but often suffer from missing data, inconsistencies, and privacy limitations that restrict data sharing. The use of generative AI to create synthetic data (SD) is an emerging strategy to enhance real-world evidence research potentially overcoming these challenges. Objective To evaluate the validity of AI-generated synthetic data (SD) in replicating real data collected in the Italian MS and Related Disorders Register (RISM), and to compare the risk of progression independent of relapse activity (PIRA) between early intensive treatment (EIT) versus escalation treatment strategy (ESC) in both real and synthetic MS cohorts. Design, Setting, and Participants This validation study analyzed data from RISM. AI-based generative models were trained on a sub-cohort of 1,666 patients with tabularized MRI data to generate a synthetic dataset of 4,878 patients. SD was evaluated using the Synthetic vAlidation FramEwork powered by Train (SAFE), assessing fidelity, utility, and privacy. Clinical Synthetic Fidelity (CSF) and Nearest Neighbor Distance Ratio (NNDR) were used for statistical and privacy validation. Treatment outcome comparisons between EIT and ESC strategies were conducted for clinical validation using both real and synthetic datasets, focusing on the risk of PIRA. Exposures Initial disease-modifying therapy strategy, categorized as EIT versus ESC. Main Outcomes and Measures Primary outcome was the occurrence of PIRA, defined as confirmed disability accrual independent of relapses. Validation metrics included Clinical Synthetic Fidelity (CSF ≥90 optimal) and Nearest Neighbor Distance Ratio (NNDR, range 0.60–0.85 for privacy). Results The synthetic dataset demonstrated high fidelity (CSF=97%) and privacy preservation (NNDR=0.61). Treatment effect estimates for ESCs vs EIT were consistent across real and synthetic datasets, with largely comparable trends, with increased statistical significance in SD. Cox proportional hazards models confirmed the robustness of synthetic data in estimating the risk of the first PIRA event. Conclusions and Relevance AI-generated synthetic data reliably replicated treatment effect outcomes from real-world RISM data, overcoming missing data and providing a privacy-preserving alternative for data sharing and clinical research. Key points Question Can Artificial Intelligence (AI)-generated synthetic data (SD) reliably replicate multiple sclerosis (MS) registry data and provide robust insights into progression independent of relapse activity (PIRA) phenomena under different treatment strategies? Findings In a cohort of 4,878 relapsing-onset MS patients from the Italian MS Register, AI-generated SD achieved high fidelity (CSF = 97%), and reproduced treatment effect outcomes. Both real and synthetic cohorts consistently showed that early intensive therapy reduced the risk of PIRA compared with an escalation strategy. Meaning SD can complement and enhance registry-based research by addressing missing data and supporting reproducible analyses in MS.
Ähnliche Arbeiten
Rating neurologic impairment in multiple sclerosis
1983 · 14.777 Zit.
Diagnostic criteria for multiple sclerosis: 2010 Revisions to the McDonald criteria
2011 · 9.757 Zit.
Diagnosis of multiple sclerosis: 2017 revisions of the McDonald criteria
2017 · 7.643 Zit.
New diagnostic criteria for multiple sclerosis: Guidelines for research protocols
1983 · 7.389 Zit.
Recommended diagnostic criteria for multiple sclerosis: Guidelines from the international panel on the diagnosis of multiple sclerosis
2001 · 6.923 Zit.
Autoren
- Pietro Iaffaldano
- Saverio D’Amico
- Giuseppe Lucisano
- Massimiliano Copetti
- Tommaso Guerra
- Maria A. Rocca
- Francesco Patti
- Giovanna De Luca
- Diana Ferraro
- Rocco Totaro
- Vincenzo Brescia Morra
- Giuseppe Salemi
- Emilio Portaccio
- Matteo Foschi
- Matilde Inglese
- Maria Gabriella Coniglio
- Clara Grazia Chisari
- Francesca Caputo
- Damiano Paolicelli
- Mario Alberto Battaglia
- Matteo Della Porta
- Victor Savevski
- Mattia Delleani
- Filomena Colella
- Elisabetta Sauta
- Maria Pia Amato
- Massimo Filippi
- María Trojano
Institutionen
- University of Bari Aldo Moro(IT)
- IRCCS Humanitas Research Hospital(IT)
- Center for Outcomes Research and Clinical Epidemiology(IT)
- Casa Sollievo della Sofferenza(IT)
- Istituti di Ricovero e Cura a Carattere Scientifico(IT)
- Vita-Salute San Raffaele University(IT)
- Istituto di Ricovero e Cura a Carattere Scientifico San Raffaele
- University of Catania(IT)
- University of Chieti-Pescara(IT)
- Azienda Unita' Sanitaria Locale Di Modena(IT)
- Azienda Ospedaliero-Universitaria di Modena(IT)
- San Salvatore Hospital(IT)
- University of Naples Federico II(IT)
- University of Palermo(IT)
- University of Florence(IT)
- University of L'Aquila(IT)
- University of Genoa(IT)
- Ospedale Madonna Delle Grazie(IT)
- Multiple Sclerosis Foundation(US)
- LivaNova (Italy)(IT)
- IRCCS Ospedale San Raffaele(IT)