Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Using Large Language Models in the Diagnosis of Acute Cholecystitis: Assessing Accuracy and Guidelines Compliance
1
Zitationen
9
Autoren
2025
Jahr
Abstract
BackgroundLarge language models (LLMs) are advanced tools capable of understanding and generating human-like text. This study evaluated the accuracy of several commercial LLMs in addressing clinical questions related to diagnosis and management of acute cholecystitis, as outlined in the Tokyo Guidelines 2018 (TG18). We assessed their congruence with the expert panel discussions presented in the guidelines.MethodsWe evaluated ChatGPT4.0, Gemini Advanced, and GPTo1-preview on ten clinical questions. Eight derived from TG18, and two were formulated by the authors. Two authors independently rated the accuracy of each LLM's responses on a four-point scale: (1) accurate and comprehensive, (2) accurate but not comprehensive, (3) partially accurate, partially inaccurate, and (4) entirely inaccurate. A third author resolved any scoring discrepancies. Then, we comparatively analyzed the performance of ChatGPT4.0 against newer large language models (LLMs), specifically Gemini Advanced and GPTo1-preview, on the same set of questions to delineate their respective strengths and limitations.ResultsChatGPT4.0 provided consistent responses for 90% of the questions. It delivered "accurate and comprehensive" answers for 4/10 (40%) questions and "accurate but not comprehensive" answers for 5/10 (50%). One response (10%) was rated as "partially accurate, partially inaccurate." Gemini Advanced demonstrated higher accuracy on some questions but yielded a similar percentage of "partially accurate, partially inaccurate" responses. Notably, neither model produced "entirely inaccurate" answers.DiscussionLLMs, such as ChatGPT and Gemini Advanced, demonstrate potential in accurately addressing clinical questions regarding acute cholecystitis. With awareness of their limitations, their careful implementation, and ongoing refinement, LLMs could serve as valuable resources for physician education and patient information, potentially improving clinical decision-making in the future.
Ähnliche Arbeiten
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries
2021 · 110.232 Zit.
Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries
2018 · 87.247 Zit.
Global cancer statistics
2011 · 54.999 Zit.
Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012
2014 · 28.930 Zit.
Global cancer statistics, 2012
2015 · 27.302 Zit.