Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Exploring ChatGPT's Aptitude in Essential Concepts of Hypertension
0
Zitationen
6
Autoren
2023
Jahr
Abstract
Background: ChatGPT is a state-of-the-art language model with human-like response generation capacity for various tasks. While there are debates about the possibility of ChatGPT replacing clinicians in clinical settings, its competence in nephrology, specifically in hypertension, remains uncertain. This study aims to assess ChatGPT's proficiency in addressing fundamental queries related to the diagnosis, treatment, and management of hypertension. Methods: Using the Nephrology Self-Assessment Program (NephSAP) issues 2016-2022: V15N1, V17N1, V19N1, V21N4 from the American Society of Nephrology, we conducted a rigorous evaluation of ChatGPT's accuracy in answering questions related to hypertension. We excluded questions containing images due to ChatGPT's current limitations in image processing. The analysis included 95 questions from NephSAP. Each question set was executed 3 times using ChatGPT (version Mar 14, OpenAI), and we determined the level of agreement between the initial and subsequent attempts, conducted 2 weeks apart. Results: Our analysis revealed that ChatGPT achieved accuracies of 65.5% on first attempt, and 76.4 and 78.1 % on second and on third attempts, respectively, for the NephSAP questions. We noted that ChatGPT had a higher level of correct answers compared to incorrect ones, and it improved its knowledge after every attempt (table 1). Conclusions: Our findings indicate that ChatGPT's accuracy in addressing core concepts related to hypertension management falls below the minimum passing threshold of 75% established by the ASN for nephrologists, with an initial accuracy rate of 65.5%. This emphasizes the need for further development and training to improve ChatGPT's accuracy and consistency in the area of hypertension. Our study's outcomes have significant implications for ChatGPT's potential use as an educational tool for clinicians, highlighting the importance of ongoing research and development to broaden its proficiency in clinical subspecialties. Accuracy of ChapGPT on Hypertension Questions* Questions 1-25
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.316 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.177 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.575 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.468 Zit.