Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Comparison of Grok and ChatGPT in Temporomandibular Joint Magnetic Resonance Images Interpretation: Sectional Study
0
Zitationen
2
Autoren
2026
Jahr
Abstract
Objective: With the rapid development of artificial intelligence, large language models (LLMs) have emerged as powerful tools capable of processing and interpreting complex medical data. These models can assist in the interpretation of imaging data, especially in conditions that require detailed anatomical analysis such as temporomandibular joint disorders. This study evaluated and compared the performance of 2 LLMs, Chat Generative Pre-trained Transformer-4 Omni (ChatGPT-4o) and Grok, in diagnosing temporomandibular joint disc displacement using magnetic resonance imaging (MRI). Material and Methods: A total of 129 sagittal MRI, including T1- and T2- weighted sequences, were retrospectively analyzed. The images were annotated to identify the disc and mandibular condyle, with diagnoses confirmed by oral and maxillofacial radiology experts. Both models were tasked with identifying anatomical structures and assessing the disc-condyle relationship. Results: Among the analyzed images, 65 showed disc displacement and 64 did not. ChatGPT-4o achieved an overall diagnostic accuracy of 67.4%, with a perfect sensitivity of 100% but lower specificity and precision. In contrast, Grok demonstrated an accuracy of 49.7% (p<0.005), but outperformed ChatGPT-4o in specificity (76.9%), precision (61.5%), and F1-score (58.1%). While ChatGPT-4o showed superior performance in identifying all pathological cases, Grok exhibited greater balance in reducing false positives. Conclusion: This study highlights the potential of LLMs as supplementary tools in oral and maxillofacial radiology while emphasizing the need for further advancements to improve their diagnostic capabilities.
Ähnliche Arbeiten
The long-term efficacy of currently used dental implants: a review and proposed criteria of success.
1986 · 3.692 Zit.
The Gingival Index, the Plaque Index and the Retention Index Systems
1967 · 3.647 Zit.
The burden of oral disease: challenges to improving oral health in the 21st century.
2005 · 3.579 Zit.
Periodontitis: Consensus report of workgroup 2 of the 2017 World Workshop on the Classification of Periodontal and Peri‐Implant Diseases and Conditions
2018 · 3.081 Zit.
Osseointegrated Titanium Implants:<i>Requirements for Ensuring a Long-Lasting, Direct Bone-to-Implant Anchorage in Man</i>
1981 · 2.652 Zit.