Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
ChatGPT's Performance on Iran's Medical Licensing Exams
10
Zitationen
12
Autoren
2023
Jahr
Abstract
Abstract Background: A 175 billion parameter Transformer architecture is used by OpenAI's ChatGPT language model to perform tasks requiring natural language processing. This study aims to evaluate the knowledge and interpretive abilities of ChatGPT on three types of Iranian medical license exams: basic sciences, pre-internship, and pre-residency. Methods: The study involved administering three different levels of Iran's medical license exams, which included basic sciences, pre-internship, and pre-residency, to Chat GPT 3.5. Two versions of each exam were used, corresponding to the ChatGPT 3.5's internet access time: one during the access time and one after. These exams were input to Chat GPT in Persian and English languages without any justifications. The accuracy and concordance of each question were extracted by two blinded adjudicators. Results: A total of 2210 questions, including 667 basic sciences, 763 pre-internship, and 780 pre-residency questions, were presented to ChatGPT in both English and Persian languages. Across all tests, the overall accuracy was found to be 48.5%, with an overall concordance of 91%. Notably, English questions exhibited higher accuracy and concordance rates, with 61.4% accuracy and 94.5% concordance, compared to 35.7% accuracy and 88.7% concordance for Persian questions. Conclusion: Our findings demonstrate that ChatGPT performs above the required passing score on the three categories of exam questions. Moreover, the model showcases its ability to provide reasoning and contextual information in the majority of responses, owing to its dialogic character when addressing inquiries. These results provide compelling evidence for the potential use of ChatGPT in medical education.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.245 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.102 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.468 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.429 Zit.