Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
P0678 Therapeutic Decisions in Inflammatory Bowel Disease: ChatGPT-4.0 Compared to Medical Expertise
0
Zitationen
4
Autoren
2026
Jahr
Abstract
Abstract Background With the increasing prevalence of inflammatory bowel disease (IBD) and its complex therapeutic management, evaluating innovative approaches has become essential. Large language models (LLM), such as ChatGPT-4.0, have experienced remarkable growth recently. While several studies have assessed LLMs across different specialties, their application in IBD management remains unexplored. This study compares therapeutic recommendations generated by ChatGPT-4.0 with those made by gastroenterologist IBD specialists, evaluating its potential applications and limitations in clinical practice. Methods This prospective observational study was conducted at a University Hospital between September 2024 and March 2025, including 47 cases of IBD patients requiring initiation or modification of biologic therapy. Standardized clinical vignettes were developed to provide both the IBD specialists and ChatGPT-4.0 with similar information concerning patients’ presentations. For each case, questions were addressed in a consistent sequence: the choice of the therapeutic class of the treatment, followed by the specific molecule in this class, and finally if the proposition is a monotherapy or combination therapy. The propositions were compared using the percentage of agreement and the Cohen’s kappa coefficient (κ). Results The agreement between the IBD specialists and ChatGPT-4.0 was 76.6% (κ = 0.677) for the therapeutic class and 44.7% (κ = 0.352) for specific molecule selection. Therapeutic class discordance occurred in 11 cases (23.4%). All discordant cases were reviewed by the gastroenterologists and deemed acceptable choices. Among these, seven second-line suggestions made by ChatGPT-4.0 aligned with the IBD specialists’ treatment decisions. Conclusion This observational prospective study highlights a promising area of ChatGPT-4.0 application in the therapeutic management of IBD. However, it is not suitable as a standalone or fully reliable resource for decision-making in healthcare. Future efforts should focus on developing tailored healthcare GPT that incorporates European medical guidelines and experts’ feedback to further improve LLM performance. Conflict of interest: Dr. Loncour, Helena: No conflict of interest Aoun, Jennifer: No conflicts Muls, Vinciane: No conflict of interest Hoyois, Alice: Sponsored invitation to congress: Abbvie
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.245 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.100 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.466 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.429 Zit.