Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Authors’ Reply: Critical Limitations in Comparing ChatGPT and DeepSeek for Orthopedic Assessment

2026·0 Zitationen·JMIR Formative ResearchOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2026

Jahr

Abstract

We respond to comments on our study comparing ChatGPT and DeepSeek for answering orthopedic multiple-choice questions. We clarify that the reported Cohen κ values reflect inter-rater reliability within each model rather than agreement between the two models. All questions were administered in English, and the findings therefore reflect performance in an English-language context. We acknowledge limitations related to reproducibility due to the use of web-based interfaces and address concerns about data contamination. We also correct a typographical error in the reported accuracy for the pelvic and spine injury category.

Autoren

Institutionen

Themen

Artificial Intelligence in Healthcare and EducationClinical Reasoning and Diagnostic SkillsRadiomics and Machine Learning in Medical Imaging

Volltext beim Verlag öffnen

Authors’ Reply: Critical Limitations in Comparing ChatGPT and DeepSeek for Orthopedic Assessment

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen