Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Advancing AI in oncology: A performance comparison of ChatGPT-4o and ChatGPT-o1 in neuroendocrine tumor clinical decision making.
0
Zitationen
7
Autoren
2025
Jahr
Abstract
e13730 Background: Artificial intelligence (AI) is increasingly integrated into clinical decision-making, offering potential support in complex oncologic management. While ChatGPT (developed by OpenAI, San Francisco, CA) has demonstrated utility in various medical applications, its ability to accurately navigate guideline-based recommendations for neuroendocrine tumors (NETs) remains unexplored. Objective: This study aimed to assess the comparative performance of GPT-4o and GPT-o1 in providing guideline-based recommendations for NET management, utilizing the National Comprehensive Cancer Network (NCCN) guidelines as a reference standard. Methods: A total of 43 clinical questions were systematically derived from the NCCN guidelines, covering key decision-making domains: treatment, surveillance, and diagnostics. Both GPT-4o and GPT-o1 were queried, with responses independently evaluated by two physicians using a Likert scale (1-5), where: 5 = Fully correct, 4 = Mostly correct (minor omissions), 3 = Partially correct (lacking completion), 2 = Partially incorrect, 1 = Completely incorrect. Results: To explore the interpretative difference of ChatGPT-4o and ChatGPT-o1 on neuroendocrine tumor management, a Mann-Whitney U test was conducted. The analysis was statistically borderline (p = .050). The Mean Rank of ChatGPT-o1 was greater than that of ChatGPT-4o, thus GPT-o1 can be considered superior to GPT-4o. (Mann-Whitney U-test; Mean Rank = 47.16 vs 39.84) When stratified by question category, no statistically significant differences were observed in diagnostics, treatment, or surveillance. Conclusions: ChatGPT-o1 exhibited marginal improvements in the decision making for NET management while maintaining comparable performance to ChatGPT-4o in other domains. These findings suggest that iterative advancements in AI models may enhance their ability to support evidence-based clinical decision-making. Further studies are warranted to validate these findings in real-world oncologic practice.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.231 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.084 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.444 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.423 Zit.