Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Assessing accuracy and legitimacy of multimodal large language models on Japan Diagnostic Radiology Board Examination

2025·5 Zitationen·Japanese Journal of RadiologyOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2025

Jahr

Abstract

Recent multimodal LLMs, particularly o3 and Gemini 2.5 Pro, have demonstrated remarkable progress on JDRBE questions, reflecting their rapid evolution in diagnostic radiology. Eight multimodal large language models were evaluated on the Japan Diagnostic Radiology Board Examination. OpenAI's o3 and Google DeepMind's Gemini 2.5 Pro achieved high accuracy rates (72% and 70%) and received good legitimacy scores from human raters, demonstrating steady progress.

Autoren

Institutionen

Themen

Radiology practices and educationReliability and Agreement in MeasurementArtificial Intelligence in Healthcare and Education

Volltext beim Verlag öffnen

Assessing accuracy and legitimacy of multimodal large language models on Japan Diagnostic Radiology Board Examination

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen