Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Performance Assessment of ChatGPT for the Board Qualification Examination of the Japanese Society for Oral and Maxillofacial Radiology
1
Zitationen
10
Autoren
2025
Jahr
Abstract
Abstract The aim of this study is to assess the performance and utility of ChatGPT for the board qualification examination of the Japanese Society for Oral and Maxillofacial Radiology (JSOMR). We assessed ChatGPT responses to 149 multiple-choice questions written in Japanese for the board qualification examination of the JSOMR for the 3 years from 2020 to 2022. The questions were directly entered into ChatGPT-3.5 and ChatGPT-4 models manually one by one as a prompt. The accuracy rate was calculated and classified by year, type of multiple-choice question, and level of intellectual ability, and significant differences were noted. The accuracy rate of GPT-3.5 for the 3 years was 45.0% (51.0% for 2020, 34.0% for 2021, and 50.0% for 2022), while the accuracy rate of GPT-4 was 68.5% (73.5% for 2020, 62.0% for 2021, and 70.0% for 2022) for the board qualification examination of the JSOMR. GPT-4 had a significantly higher accuracy rate than GPT-3.5 in each year. On performance classified by the type of multiple-choice questions, GPT-4 performed significantly better than GPT-3.5. However, neither model performed well with questions that required interpretation or knowledge of Japanese law. The performance of GPT-4 was significantly superior to GPT-3.5 in the board qualification examination of the JSOMR, suggesting that the use of Chat GPT, especially ChatGPT-4, would be effective as a tool for learning and preparing for the examination.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.245 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.100 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.466 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.429 Zit.