Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

GPT versus Resident Physicians — A Benchmark Based on Official Board Scores

2024·124 Zitationen·NEJM AIOpen Access

Volltext beim Verlag öffnen

124

Zitationen

Autoren

2024

Jahr

Abstract

BACKGROUND Artificial intelligence (AI) is a burgeoning technological advancement, with considerable promise for influencing the field of medicine. As a preliminary step toward integrating AI into medical practice, it is imperative to ascertain whether model performance is comparable with that of physicians. We present a systematic comparison of performance by a large language model (LLM) versus that of a large cohort of physicians. The cohort includes all residents who took the medical specialist license examination in Israel in 2022 across the core medical disciplines: internal medicine, general surgery, pediatrics, psychiatry, and obstetrics and gynecology (OB/GYN). We provide the examinations as an accessible benchmark dataset for the medical machine learning and natural language processing communities, which may be adapted for future LLM studies.

Autoren

Institutionen

Themen

Artificial Intelligence in Healthcare and EducationCardiac, Anesthesia and Surgical OutcomesRadiology practices and education

Volltext beim Verlag öffnen

GPT versus Resident Physicians — A Benchmark Based on Official Board Scores

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen