Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Performance of Publicly Available Large Language Models on Internal Medicine Board-style Questions
0
Zitationen
6
Autoren
2023
Jahr
Abstract
<title>Abstract</title> We investigate the performance of four large language models (LLMs) on internal medicine board-style examination questions from the Medical Knowledge Self-Assessment Program released by the American College of Physicians. GPT4 outperformed GPT3.5, human users, LaMDA and LLaMA 2 in that order. A drop in GPT4 performance when accessing its API versus its publicly available chatbot (ChatGPT) was recovered through fine tuning in the form of Harrison’s Principles of Internal Medicine.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.245 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.102 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.468 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.429 Zit.