Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A comparative evaluation of two large language models in pediatric dentistry
0
Zitationen
1
Autoren
2026
Jahr
Abstract
In recent years, artificial intelligence and large language models (LLMs) have been developed rapidly. This study evaluated the accuracy, comprehensiveness, and performance of two LLMs in answering questions about pediatric dentistry. ChatGPT-3.5 and Gemini-1.5pro were tested on 45 questions about pediatric dentistry. These questions are classified as multiple-choice questions MCQ), True/false questions, and open-ended questions. They were directed to LLMs. Responses were recorded and scored by four pediatric dentists. Statistical analyses, including ICC analysis, were performed to determine the agreement and accuracy of the responses. The significance level was set as p < 0.050. Gemini was statistically significantly more accurate in 71.1% of the questions (p = 0.001). This difference was due to the T/F section (p = 0.001). ChatGPT gave more correct answers to the MCQ. There was no significant difference between the responses of the Gemini model to different types of questions prepared in the field of pediatric dentistry and the median scores (p = 0.062). The performance of LLMs in pediatric dentistry was satisfactory. However, further training using specific, relevant data derived from reliable sources is required. Additionally, the validity of these chatbots’ responses must be meticulously verified. LLMs provide clinical support to the pediatric dentist by providing the right information quickly, so they can help him/her to perform the most appropriate treatment in the shortest possible time and thus ensure child-patient cooperation.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.245 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.102 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.468 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.429 Zit.