Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Performance of Large Language Models in Lung Cancer Clinical Decision-Making: A Comparative Analysis Based on DeepSeek, Grok, and GPT
0
Zitationen
4
Autoren
2025
Jahr
Abstract
Large language models (LLMs) have reached a breakthrough in many aspects of imaging analysis and guideline mining, but not enough research has been conducted on applying them specifically to lung cancer applications. Three models were chosen, and corresponding questions that highlighted specificity towards the diagnosis of lung cancer were proposed with the goal of providing data to increase confidence and improve recommendations for transforming AI-driven clinical care for lung cancer. In this study, five clinical domains were defined. Each question was individually uploaded to the models, and responses were evaluated by three thoracic surgery experts based on accuracy, completeness, and practicality. DeepSeek-R1, Grok-3, and GPT-4.5 showed different levels of results when it came to providing clinical support for lung cancer. Regarding their responses to clinical questions, Grok-3 had a much longer average response and better performance scores. Subgroup analyses further showed that Grok-3 scored the highest of all five domains. Additionally, the confidence scores on recognizing images in the text of Grok-3 were the highest; most mistakes occurred in the differential diagnosis of special cases, whereas DeepSeek and GPT all gave preference to rarer or infectious diseases.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.316 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.177 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.575 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.468 Zit.