Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Assessing GPT-4’s diagnostic accuracy with darker skin tones: underperformance and implications
4
Zitationen
7
Autoren
2024
Jahr
Abstract
This study aimed to assess the accuracy of GPT-4 in generating appropriate differential diagnoses and arriving at the correct diagnoses for common skin lesions. Additionally, we investigated any differences in its diagnostic accuracy between darker and lighter skin tones. GPT-4 exhibited better performance in providing the correct diagnosis for lighter skin tones (44%, n = 11 of 25) compared with darker skin tones (12%, n = 3 of 25), and this was statistically significant (P < 0.05). Furthermore, with each unit increase in the Fitzpatrick scale, GPT-4’s performance decreased by 11.4% in accurately providing a differential diagnosis and by 7.1% in accurately providing the correct diagnosis.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.239 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.095 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.463 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.428 Zit.