OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 12.05.2026, 23:37

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

AI-driven rehabilitation: evaluation of ChatGPT-4o for generating personalized physical rehabilitation plans in comorbid patients

2025·1 Zitationen·Wiadomości LekarskieOpen Access
Volltext beim Verlag öffnen

1

Zitationen

5

Autoren

2025

Jahr

Abstract

OBJECTIVE: Aim: To evaluate the performance of ChatGPT-4o in creating personalized physical rehabilitation plans for comorbid patients. PATIENTS AND METHODS: Materials and Methods: ChatGPT-4o was employed to generate physical rehabilitation plans for 50 clinical cases of comorbid patients. These plans were evaluated independently by two experts according to 6 criteria using a 5-point Likert scale. Experts also classified each plan regarding its suitability for use into 3 categories: "Completely unsuitable for use", "Suitable for use with corrections", "Completely suitable for use". Statistical analysis included the Mann-Whitney U test, intraclass correlation coefficient (ICC) and linear weighted Cohen's kappa (kw). The statistical significance was set at p<0.05. RESULTS: Results: The overall mean score of ChatGPT-4o generated rehabilitation plans was 4.30±0.28 with the highest scores for respiratory and musculoskeletal pathology (4.37±0.36 and 4.33±0.24, respectively). Among the evaluation criteria, the highest indicators were observed for Clinical accuracy and Safety (4.59±0.59 and 4.41±0.71, respectively). 72.00% of the generated plans were classified as "Suitable for use with corrections". None of the plans were identified as "Completely unsuitable for use". The agreement percentage ranged from 84% to 90%, ICC values were 0.80-0.86, and overall suitability kw was 0.77. CONCLUSION: Conclusions: LLM-generated rehabilitation plans show promise as supportive tools in clinical practice, but they are not yet at a stage where they can be implemented without expert review and modification. The high overall inter-rater reliability provides confidence in the evaluation process, while also highlighting areas for improvement in both the LLM's performance and the assessment methodology.

Ähnliche Arbeiten

Autoren

Institutionen

Themen

Artificial Intelligence in Healthcare and EducationExplainable Artificial Intelligence (XAI)Digital Mental Health Interventions
Volltext beim Verlag öffnen