Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
The Effect of LLM Assistance on Diagnostic Accuracy: A Meta-Analysis
1
Zitationen
5
Autoren
2025
Jahr
Abstract
Abstract Large language models (LLMs) are increasingly used in clinical settings, yet their effect on diagnostic accuracy of physicians has not been systematically quantified. We conducted a systematic review and meta-analysis of studies analyzing LLM-assisted diagnosis published between January 2020 and June 2025. Across 15 studies (43 effect sizes; 498 physicians; 7,274 case evaluations), LLM assistance significantly improved diagnostic accuracy compared to physicians without LLM support (Hedges’ g = 0.20, 95% CI 0.12–0.29; P < .001). Although improvements were observed across multiple LLMs (e.g., GPT-4, AMIE, MedFound-DX-PA), medical fields (general medicine, radiology), and career stages of physicians (residents and attendings), the magnitude of the benefit varied substantially. These findings show that LLMs can improve diagnostic accuracy of physicians, but conditions for successful LLM assistance remain unclear. Further clinical evidence is needed to guide safe and effective integration into practice.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.200 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.051 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.416 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.410 Zit.