Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
SUN-809 Can AI Chatbots Provide Accurate Information About Thyroid Eye Disease?
0
Zitationen
10
Autoren
2025
Jahr
Abstract
Abstract Disclosure: H. Lee: None. A. Shams: None. G. Wu: None. S. Sidhu: None. A. Sidhu: None. E. Pan: None. A. Ashok: None. P. Badala: None. B. Hoang: None. M. Del Buono: None. Background: Thyroid eye disease affects about 0.25% of people and is more common in women (16 per 100,000) than men (2.9 per 100,000).In patients with Graves’ Disease, the incidence of Thyroid Eye Disease can be between 25% and 40%. Patients who develop thyroid eye disease, additionally, have an elevated risk of developing other ocular symptoms such as dry eye disease. Purpose: Evaluate the ability and accuracy of AI models in correctly diagnosing and providing information about thyroid eye disease. Methods: Questions were asked in English to Claude, Cohere, Gemini, GPT 4o Mini, and GPT 4o.Textual responses from Claude, Cohere, Gemini, GPT 4o Mini, and GPT 4o were recorded and translated with help from native speakers. Manual and AI Scores were rated on a scale from 1 to 5, 5 being the most accurate response with 1 being the least accurate response. A series of paired T-tests were used to track the difference between scores, with score difference being calculated as the AI self-score subtracted from the manual score. Results: Our R-value, or Pearson’s Correlation Coefficient, of 0.505 indicates a mild positive correlation between the manual and AI Scores across the LLMs. Therefore, both of the manual and AI scores amongst the various chatbots are fairy accurate. The manual scores were not significantly different among the different LLMs (one-way ANOVA: F(4,15) = 2.28, p = 0.108). AI rated scores and manual scores relatively align among different chatbots, suggesting that chatbots were accurate in assessing the quality of their responses. Conclusion: There is a need for further training of these LLMs on more diverse datasets when queried about less common diseases. Presentation: Sunday, July 13, 2025
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.260 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.116 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.493 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.438 Zit.