Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Assessing LLMs on IDSA Practice Guidelines for the Diagnosis and Treatment of Native Vertebral Osteomyelitis: A Comparison Study
1
Zitationen
9
Autoren
2025
Jahr
Abstract
<b>Background</b>: Native vertebral osteomyelitis (NVO) presents diagnostic and therapeutic challenges requiring adherence to complex clinical guidelines. The emergence of large language models (LLMs) offers new avenues for real-time clinical decision support, yet their utility in managing NVO has not been formally assessed. <b>Methods</b>: This study evaluated four LLMs-Consensus, Gemini, ChatGPT-4o Mini, and ChatGPT-4o-using 13 standardized questions derived from the 2015 IDSA guidelines. Each model generated 13 responses (n = 52), which were independently assessed by three orthopedic surgeons for accuracy (4-point scale) and comprehensiveness (five-point scale). <b>Results</b>: ChatGPT-4o produced the longest responses (428.0 ± 45.4 words), followed by ChatGPT-4o Mini (392.2 ± 97.4), Gemini (358.2 ± 60.5), and Consensus (213.2 ± 68.8). Accuracy ratings showed that ChatGPT-4o and Gemini achieved the highest proportion of "Excellent" responses (54% and 51%, respectively), while Consensus received only 20%. Comprehensiveness scores mirrored this trend, with ChatGPT-4o (3.95 ± 0.79) and Gemini (3.82 ± 0.68) significantly outperforming Consensus (2.87 ± 0.66). Domain-specific analysis revealed that ChatGPT-4o achieved a 100% "Excellent" accuracy rating in therapy-related questions. Statistical analysis confirmed significant inter-model differences (<i>p</i> < 0.001). <b>Conclusions</b>: Advanced LLMs-especially ChatGPT-4o and Gemini-demonstrated high accuracy and depth in interpreting clinical guidelines for NVO. These findings highlight their potential as effective tools in augmenting evidence-based decision-making and improving consistency in clinical care.
Ähnliche Arbeiten
Projections of Primary and Revision Hip and Knee Arthroplasty in the United States from 2005 to 2030
2007 · 6.866 Zit.
Traumatic Arthritis of the Hip after Dislocation and Acetabular Fractures
1969 · 5.600 Zit.
Projections of Primary and Revision Hip and Knee Arthroplasty in the United States from 2005 to 2030
2007 · 5.398 Zit.
Traumatic Arthritis of the Hip After Dislocation and Acetabular Fractures: Treatment by Mold Arthroplasty: An End-Result Study Using a New Method of Result Evaluation
2013 · 5.081 Zit.
2015 ESC Guidelines for the management of infective endocarditis
2015 · 4.901 Zit.