Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison Study (Preprint)

2023·0 ZitationenOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2023

Jahr

Abstract

<sec> <title>BACKGROUND</title> The competence of ChatGPT (Chat Generative Pre-Trained Transformer) in non-English languages is not well studied. </sec> <sec> <title>OBJECTIVE</title> This study compared the performances of GPT-3.5 (Generative Pre-trained Transformer) and GPT-4 on the Japanese Medical Licensing Examination (JMLE) to evaluate the reliability of these models for clinical reasoning and medical knowledge in non-English languages. </sec> <sec> <title>METHODS</title> This study used the default mode of ChatGPT, which is based on GPT-3.5; the GPT-4 model of ChatGPT Plus; and the 117th JMLE in 2023. A total of 254 questions were included in the final analysis, which were categorized into 3 types, namely general, clinical, and clinical sentence questions. </sec> <sec> <title>RESULTS</title> The results indicated that GPT-4 outperformed GPT-3.5 in terms of accuracy, particularly for general, clinical, and clinical sentence questions. GPT-4 also performed better on difficult questions and specific disease questions. Furthermore, GPT-4 achieved the passing criteria for the JMLE, indicating its reliability for clinical reasoning and medical knowledge in non-English languages. </sec> <sec> <title>CONCLUSIONS</title> GPT-4 could become a valuable tool for medical education and clinical support in non–English-speaking regions, such as Japan. </sec>

Autoren

Institutionen

Themen

Artificial Intelligence in Healthcare and EducationRadiomics and Machine Learning in Medical Imaging

Volltext beim Verlag öffnen

Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison Study (Preprint)

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen