Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Comparative Evaluation of Teaching Plans on Prostate Cancer Generated by Various Large Language Models and a Human Expert
0
Zitationen
6
Autoren
2025
Jahr
Abstract
ABSTRACT Prostate cancer remains one of the most common malignancies affecting men globally, characterized by high morbidity and mortality rates. The complexity and variability of the disease necessitate diverse treatment strategies, ranging from active surveillance to more aggressive interventions such as radical prostatectomy, radiation therapy, and androgen deprivation therapy. This study investigates the potential of large language models (LLMs) in generating educational content for prostate cancer, focusing on the creation of teaching plans in both Chinese and English. Four LLMs—GPT‐4 (OpenAI), Gemini 1.5 Pro (Google), Kimi AI (Microsoft), and Douban (ByteDance)—were evaluated against teaching plans developed by an experienced urology professor. A double‐blind assessment by 25 urology faculty members using a standardized 10‐point scale was employed to compare the quality of curriculum content, learning objectives, and outcomes. The results revealed that GPT‐4 and Gemini 1.5 Pro outperformed Kimi AI and Douban, yet still lagged behind human‐generated plans, particularly in Chinese. Statistical analyses indicated significant differences in the quality scores among the LLMs and the human experts, underscoring the necessity of integrating domain‐specific knowledge into AI‐generated content. This research highlights the promise and limitations of LLMs in medical education, suggesting that future developments should focus on hybrid models that combine artificial intelligence with human expertise to enhance educational efficacy.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.292 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.143 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.539 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.452 Zit.