Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluation of YouTube Exercise Videos for Fall Prevention in Older Adults: ChatGPT-4.5 Versus Human Experts
0
Zitationen
5
Autoren
2025
Jahr
Abstract
Objective: This study aims to determine the potential and limitations of artificial intelligence (AI) in this field by comparing the results of ChatGPT-4.5 and experts in the evaluation of YouTube videos intended for older adults. Materials and Methods: A search was conducted on YouTube using the keyword “fall prevention exercises for elderly,” and the 100 most viewed videos were examined. Of these, 64 videos that met the criteria were included in the study. The comprehensiveness, quality [global quality scale (GQS)], and reliability [Quality Criteria for Consumer Health Information (DISCERN)] of the videos were evaluated by two independent physiotherapists and ChatGPT-4.5. Agreement between the evaluations was tested using Wilcoxon signed-rank test, intraclass correlation coefficient (ICC), and Bland-Altman analyses. Results: No significant differences were found between ChatGPT-4.5 and human experts in terms of comprehensiveness (p=0.242) and GQS (p=0.083) scores, and a high level of agreement was observed (ICC 0.932 and 0.876, respectively). However, in DISCERN scores, ChatGPT-4.5 awarded significantly higher scores than the human experts (p=0.005), and the level of agreement was determined to be excellent (ICC=0.942). Nevertheless, a wide range of differences (limits of agreement: -4.9 to 7.18) was identified. Conclusion: ChatGPT-4.5 can be used as a reliable assessment tool in determining the comprehensiveness and quality levels of fall prevention exercise videos. However, it was concluded that in reliability scoring, AI should be used under expert supervision.
Ähnliche Arbeiten
Structural Equation Modelling: Guidelines for Determining Model Fit
2008 · 8.733 Zit.
Determination and Quantification Of Content Validity
1986 · 6.158 Zit.
The content validity index: Are you sure you know what's being reported? critique and recommendations
2006 · 6.028 Zit.
Health literacy and public health: A systematic review and integration of definitions and models
2012 · 5.779 Zit.
Is the CVI an acceptable indicator of content validity? Appraisal and recommendations
2007 · 4.709 Zit.