Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Large Language Models in Healthcare: A Meta-Analysis for Clinical Decision-Making
0
Zitationen
7
Autoren
2025
Jahr
Abstract
This study presents a comprehensive evaluation of AI diagnostic models across diverse clinical cases sourced from Thieme (77 cases) and Elsevier (48 cases). The dataset spans frequent (42 cases), less frequent (44 cases), and rare (39 cases) conditions, ensuring balanced assessment. Diagnostic performance, treatment recommendation accuracy, and linguistic reliability were compared across multiple state-of-the-art AI systems. Results highlight strong overall diagnostic capabilities, with notable variations across specialties and disease frequencies. While Pediatrics consistently demonstrated the highest performance, Surgery emerged as the most challenging specialty. Among models, GPT-4o achieved superior diagnostic consensus, treatment recommendation accuracy, and linguistic precision, underscoring its clinical utility. The findings provide empirical benchmarks for advancing AI-based medical decision support systems.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.551 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.443 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.942 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.792 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.