Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
ChatGPT-Exploring Its Role in Clinical Chemistry.
1
Zitationen
7
Autoren
2023
Jahr
Abstract
OBJECTIVE: To evaluate the utility of artificial intelligence-powered language models (ChatGPT 3.5 and GPT-4) compared to trainees and clinical chemists in responding to common laboratory questions in the broad area of Clinical Chemistry. METHODS: 35 questions from real-life case scenarios, clinical consultations, and clinical chemistry testing questions were used to evaluate ChatGPT 3.5, and GPT-4 alongside clinical chemistry trainees (residents/fellows) and clinical chemistry faculty. The responses were scored based on category and based on years of experience. RESULTS: The Senior Chemistry Faculty demonstrated superior accuracy with 100% of correct responses compared to 90.5%, 82.9%, and 71.4% of correct responses from the junior chemistry faculty, fellows, and residents respectively. They all outperformed both ChatGPT 3.5 and GPT-4 which generated 60% and 71.4% correct responses respectively. Of the sub-categories examined, ChatGPT 3.5 achieved 100% accuracy in endocrinology while GPT-4 did not achieve 100% accuracy in any subcategory. GPT-4 was overall better than ChatGPT 3.5 by generating similar correct responses as residents (71.4%) but performed poorly to human participants when both partially correct and incorrect indices were considered. CONCLUSION: Despite all the advances in AI-powered language models, ChatGPT 3.5 and GPT-4 cannot replace a trained pathologist in answering clinical chemistry questions. Caution should be observed by people, especially those not trained in clinical chemistry, to interpret test results using chatbots.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.646 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.554 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.071 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.851 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.