Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluating Large Language Models for Mild Cognitive Impairment: A Bilingual Comparison of ChatGPT, Gemini, and Kimi
0
Zitationen
7
Autoren
2025
Jahr
Abstract
Abstract Background Mild Cognitive Impairment (MCI) is a key stage between normal aging and Alzheimer’s Disease (AD), with early intervention crucial for slowing progression. Large Language Models (LLMs) offer promising support by providing accessible, evidence-based information for non-specialist healthcare professionals and care partners. However, concerns about accuracy and limited multilingual evaluations remain. Objective This study explores the potential of LLMs in managing MCI, examines their support for non-specialist healthcare professionals and care partners, and compares English and Chinese responses to MCI-related queries, considering language-specific nuances and effectiveness. Methods We submitted 72 open-ended questions related to MCI management to ChatGPT-4o, Gemini, and Kimi, assessing their responses based on accuracy, comprehensibility, specificity, and actionability using a five-point Likert scale. Statistical analyses, including Intraclass Correlation Coefficients and Mann-Whitney U tests, were conducted to examine response across models. Result LLMs’ performance in MCI management was evaluated, with the Symptoms and Diagnosis domain scoring highest. Healthcare professionals’ needs were better met than care partners’, particularly in accuracy, comprehensibility, and actionability. English responses outperformed Chinese in comprehensibility and specificity. Conclusion Based on the results, LLMs demonstrate potential in assisting non-specialist healthcare professionals and care partners, particularly in the domains of symptoms and diagnosis. However, there is a need for further optimization in Chinese medical corpora, as English responses outperformed Chinese ones due to corpus disparities. Tailored models for care partners, focusing on reliability and clarity, are essential to address their unmet needs and enhance their experience in managing MCI.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.200 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.051 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.416 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.410 Zit.