Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Specialized Large Language Model Outperforms Neurologists at Complex Diagnosis in Blinded Case-Based Evaluation
13
Zitationen
25
Autoren
2025
Jahr
Abstract
<b>Background/Objectives</b>: Artificial intelligence (AI), particularly large language models (LLMs), has demonstrated versatility in various applications but faces challenges in specialized domains like neurology. This study evaluates a specialized LLM's capability and trustworthiness in complex neurological diagnosis, comparing its performance to neurologists in simulated clinical settings. <b>Methods</b>: We deployed GPT-4 Turbo (OpenAI, San Francisco, CA, US) through Neura (Sciense, New York, NY, US), an AI infrastructure with a dual-database architecture integrating "long-term memory" and "short-term memory" components on a curated neurological corpus. Five representative clinical scenarios were presented to 13 neurologists and the AI system. Participants formulated differential diagnoses based on initial presentations, followed by definitive diagnoses after receiving conclusive clinical information. Two senior academic neurologists blindly evaluated all responses, while an independent investigator assessed the verifiability of AI-generated information. <b>Results</b>: AI achieved a significantly higher normalized score (86.17%) compared to neurologists (55.11%, <i>p</i> < 0.001). For differential diagnosis questions, AI scored 85% versus 46.15% for neurologists, and for final diagnosis, 88.24% versus 70.93%. AI obtained 15 maximum scores in its 20 evaluations and responded in under 30 s compared to neurologists' average of 9 min. All AI-provided references were classified as relevant with no hallucinatory content detected. <b>Conclusions</b>: A specialized LLM demonstrated superior diagnostic performance compared to practicing neurologists across complex clinical challenges. This indicates that appropriately harnessed LLMs with curated knowledge bases can achieve domain-specific relevance in complex clinical disciplines, suggesting potential for AI as a time-efficient asset in clinical practice.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.200 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.051 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.416 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.410 Zit.
Autoren
- Sami Barrit
- Nathan Torcida
- Aurélien Mazeraud
- Sébastien Boulogne
- Jeanne Benoit
- Timothée Carette
- Thibault Carron
- Bertil Delsaut
- Eva Diab
- Hugo Kermorvant
- Adil Maarouf
- Sofia Maldonado Slootjes
- Sylvain Redon
- Alexis Robin
- Sofiène Hadidane
- Vincent Harlay
- Vito Tota
- Tanguy Madec
- Alexandre Niset
- Mejdeddine Al Barajraji
- Joseph R. Madsen
- Salim El Hadwe
- Nicolas Massager
- Stanislas Lagarde
- Romain Carron
Institutionen
- Boston Children's Hospital(US)
- Université Libre de Bruxelles(BE)
- Harvard University(US)
- Centre Hospitalier Universitaire de Tivoli(BE)
- Université Paris Cité(FR)
- Institut de Psychiatrie et Neurosciences de Paris(FR)
- FHU Neurovasc(FR)
- Université Claude Bernard Lyon 1(FR)
- Université Côte d'Azur(FR)
- Centre Hospitalier Universitaire de Nice(FR)
- Clinique Saint Pierre(BE)
- UCLouvain(BE)
- Centre National de la Recherche Scientifique(FR)
- Sorbonne Université(FR)
- Centre Hospitalier Universitaire Amiens-Picardie(FR)
- Chirurgie et extrémité céphalique, caractérisation morphologique et fonctionnelle(FR)
- Aix-Marseille Université(FR)
- Centre de Résonance Magnétique Biologique et Médicale(FR)
- Hôpital de la Timone(FR)
- Vrije Universiteit Brussel(BE)
- Universitair Ziekenhuis Brussel(BE)
- Centre Hospitalier Universitaire de Grenoble(FR)
- CHU Ambroise Paré(BE)
- University of New Caledonia(NC)
- Centre Hospitalier Territorial de Nouvelle-Calédonie(NC)
- Cliniques Universitaires Saint-Luc(BE)
- University Hospital of Lausanne(CH)
- University of Cambridge(GB)
- Inserm(FR)
- Institut de Neurosciences des Systèmes(FR)