Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluation of Large Language Models in Medical Examinations: A Scoping Review Protocol
0
Zitationen
5
Autoren
2025
Jahr
Abstract
INTRODUCTION: Standardized medical examinations, used to assess trainee clinical competencies, provide a rigorous means to verify LLM accuracy and reliability in medical contexts. Although current evaluations use these exams to test LLMs' clinical reasoning, significant performance variations occur across different clinical scenarios. Existing methods struggle to adapt to evolving research needs. This study synthesizes prior research on LLMs in medical exams, highlighting current limitations and proposing future research directions. METHODS AND ANALYSIS: The formulation of the protocol was guided by the standards set forth in the JBI Manual for Evidence Synthesis. Following the establishment of precise inclusion/exclusion criteria and search strategies, we will execute systematic searches in the PubMed and Web of Science Core Collection databases. The method encompasses literature review, data extraction, analytical frameworks, and process mapping. By employing this method, researchers maintain methodological rigor during the entire research process. ETHICS AND DISSEMINATION: This protocol describes a method for performing a scoping review. The investigation focuses on the organized synthesis and examination of previously published research. It does not include human/animal experimentation or sensitive data collection. Ethical approval is not required for this literature-based study.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.551 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.443 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.942 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.792 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
Institutionen
- Changchun University of Chinese Medicine(CN)
- Beijing University of Chinese Medicine(CN)
- Dongzhimen Hospital Affiliated to Beijing University of Chinese Medicine(CN)
- Chinese Academy of Medical Sciences & Peking Union Medical College(CN)
- Hunan Academy of Traditional Chinese Medicine(CN)
- China Academy of Chinese Medical Sciences(CN)
- Peking Union Medical College Hospital(CN)
- Leipzig University(DE)
- Peking University(CN)