Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Dermatology “AI Babylon”: Cross-Language Evaluation of AI-Crafted Dermatology Descriptions
0
Zitationen
14
Autoren
2026
Jahr
Abstract
<i>Background and Objectives</i>: Dermatology relies on a complex terminology encompassing lesion types, distribution patterns, colors, and specialized sites such as hair and nails, while dermoscopy adds an additional descriptive framework, making interpretation subjective and challenging. Our study aims to evaluate the ability of a chatbot (Gemini 2) to generate dermatology descriptions across multiple languages and image types, and to assess the influence of prompt language on readability, completeness, and terminology consistency. Our research is based on the concept that non-English prompts are not mere translations of the English prompts but are independently generated texts that reflect medical and dermatological knowledge learned from non-English material used in the chatbot's training. <i>Materials and Methods</i>: Five macroscopic and five dermoscopic images of common skin lesions were used. Images were uploaded to Gemini 2 with language-specific prompts requesting short paragraphs describing visible features and possible diagnoses. A total of 2400 outputs were analyzed for readability using LIX score and CLEAR (comprehensiveness, accuracy, evidence-based content, appropriateness, and relevance) assessment, while terminology consistency was evaluated via SNOMED CT mapping across English, French, German, and Greek outputs. <i>Results</i>: English and French descriptions were found to be harder to read and more sophisticated, while SNOMED CT mapping revealed the largest terminology mismatch in German and the smallest in French. English texts and macroscopic images achieved the highest accuracy, completeness, and readability based on CLEAR assessment, whereas dermoscopic images and non-English texts presented greater challenges. <i>Conclusions</i>: Overall, partial terminology inconsistencies and cross-lingual variations highlighted that the language of the prompt plays a critical role in shaping AI-generated dermatology descriptions.
Ähnliche Arbeiten
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.144 Zit.
Tumor Angiogenesis: Therapeutic Implications
1971 · 10.083 Zit.
Improved Survival with Vemurafenib in Melanoma with BRAF V600E Mutation
2011 · 7.642 Zit.
Final Version of 2009 AJCC Melanoma Staging and Classification
2009 · 4.547 Zit.
Technical Details of Intraoperative Lymphatic Mapping for Early Stage Melanoma
1992 · 4.392 Zit.
Autoren
Institutionen
- Aristotle University of Thessaloniki(GR)
- University Hospital of Larissa(GR)
- Andreas Sygros Hospital(GR)
- Frauenklinik an der Elbe(DE)
- National and Kapodistrian University of Athens(GR)
- Aretaeio Hospital(GR)
- University Hospital of Heraklion(GR)
- University of Thessaly(GR)
- Nicosia General Hospital(CY)
- Development Agency of Karditsa(GR)
- University General Hospital Attikon(GR)
- Nnamdi Azikiwe University Teaching Hospital(NG)
- Université Joseph Ki-Zerbo(BF)