Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Comparative performance of large language models in answering periodontology questions from the Turkish Dental Specialty Examination: a cross-sectional study on accuracy and coverage

2025·3 Zitationen·BMC Oral HealthOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2025

Jahr

Abstract

Among the tested LLMs, ChatGPT-4 consistently outperformed others in accuracy, while DeepSeek-R1 and Gemini demonstrated moderate performance and Claude lagged behind. Accuracy was lower in clinical questions, reflecting the contextual complexity of clinical reasoning. Coverage scores did not differ significantly, indicating broadly similar comprehensiveness of responses.

Autoren

Institutionen

Dicle University(TR)

Themen

Artificial Intelligence in Healthcare and EducationClinical Reasoning and Diagnostic SkillsExplainable Artificial Intelligence (XAI)

Volltext beim Verlag öffnen

Comparative performance of large language models in answering periodontology questions from the Turkish Dental Specialty Examination: a cross-sectional study on accuracy and coverage

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen