OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 20.03.2026, 05:21

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Can AI outperform professional writers in summarizing foot and ankle literature?

2025·0 Zitationen·Foot & Ankle Surgery Techniques Reports & CasesOpen Access
Volltext beim Verlag öffnen

0

Zitationen

2

Autoren

2025

Jahr

Abstract

<h2>Abstract</h2> This study evaluates the performance of an advanced large language model in summarizing scientific literature within the specialized field of foot and ankle surgery. Building upon prior work that demonstrated ChatGPT-3.5′s comparability to podiatric residents, this investigation compares ChatGPT-4.5 directly against paid, professionally written summaries sourced from Foot and Ankle Quarterly. Ten original research articles were summarized by ChatGPT-4.5 and matched with corresponding professionally written summaries. Quantitative analysis using BLEU and ROUGE metrics assessed textual similarity, while Flesch Reading Ease and Flesch-Kincaid Grade Level scores evaluated readability. A qualitative preference survey was conducted among three blinded, fellowship-trained foot and ankle surgeons. Results showed that AI-generated summaries were preferred in 73.33% of comparisons and demonstrated no factual inaccuracies. Although professionally written summaries were quantitatively more readable, AI-generated summaries maintained higher consistency in language complexity. ROUGE scores suggested substantial content overlap between AI-generated and reference summaries, whereas BLEU scores reflected differences, which may be attributable to shorter AI summary lengths. These findings suggest ChatGPT-4.5 can reliably and efficiently produce accurate, high-quality summaries, potentially surpassing paid academic writers in certain domains. Broader implications include improved efficiency in academic research and literature review. Continued investigation and oversight are necessary to guide the responsible integration of AI tools into clinical and scholarly workflows. <h3>Level of Evidence</h3> III, comparative study

Ähnliche Arbeiten