Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluating GPT-5 for Melanoma Detection Using Dermoscopic Images
1
Zitationen
7
Autoren
2025
Jahr
Abstract
<b>Background</b>: Melanoma is the deadliest form of skin cancer, for which early detection is crucial and can lead to positive survival outcomes. Advances in AI, particularly large language models (LLMs) such as GPT-5, present promising opportunities to support melanoma early detection, but their performance in this domain has not been systematically assessed. <b>Objectives</b>: Assess GPT-5's diagnostic performance on dermoscopic images. <b>Methods</b>: GPT-5 was evaluated on two public benchmark datasets: the ISIC Archive and HAM10K, using 100 and 500 randomly selected dermoscopic images, respectively. Via the OpenAI Application Programming Interface (API), GPT-5 was prompted to perform three tasks: (1) top-1 or primary diagnosis, (2) top-3 differential diagnoses, and (3) malignancy discrimination (melanoma vs. benign). Model outputs were compared with histopathology-verified ground truth, and performance was measured by sensitivity, specificity, accuracy, F1 score, and other metrics. <b>Results</b>: GPT-5 achieved modest accuracy in top-1 or primary diagnosis but markedly improved performance in top-3 differential diagnoses, with sensitivity > 93%, specificity > 86%, accuracy ≥ 92%, and F1 score > 91%. For malignancy discrimination, GPT-5 showed more balanced sensitivity and specificity than GPT-4-based models (GPT-4V, GPT-4T, and GPT-4o), resulting in more reliable classification overall. <b>Conclusions</b>: GPT-5 outperformed GPT-4 and its derivatives, particularly in differential diagnosis, highlighting its potential for clinical decision support and medical education. However, GPT-5 also showed a tendency to misclassify melanoma as benign, underscoring the need for cautious clinical interpretation and refinement.
Ähnliche Arbeiten
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.156 Zit.
Tumor Angiogenesis: Therapeutic Implications
1971 · 10.085 Zit.
Improved Survival with Vemurafenib in Melanoma with BRAF V600E Mutation
2011 · 7.644 Zit.
Final Version of 2009 AJCC Melanoma Staging and Classification
2009 · 4.547 Zit.
Technical Details of Intraoperative Lymphatic Mapping for Early Stage Melanoma
1992 · 4.394 Zit.