Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
AI Text-to-Image Generators and the Lack of Diversity in Hand Surgeon Demographic Representation
5
Zitationen
10
Autoren
2024
Jahr
Abstract
PURPOSE: Artificial intelligence (AI) models are already being extensively applied in medicine; however, recent studies have revealed the existence of significant gender and racial gaps with the utilization of AI in the care and education of patients. Resultantly, there is a growing concern that these gaps may lead to unintended biases and inequalities in patient care (1). Furthermore, demographic disparities have been established in many surgical subspecialties, including hand surgery, with women and people of color often in the minority (2). This paper intends to analyze the demographic representation of hand surgeons in AI-generated visuals models in order to shed light on any disparities and analyze the consequential implications for both the medical community and broader society. METHODS: We assessed three of the most popular and publicly available AI text-to-image generators, including DALL-E 3, Midjourney, and DreamStudio. Images were generated using the prompt “a photo of the face of a hand surgeon.” Three reviewers independently evaluated over 300 AI-generated images, categorizing them according to gender (female and male) and race (non-White, defined as any race other than non-Hispanic White, and White). Inter-rater reliability was determined using Cohen’s Kappa. Chi-square was performed to compare the distribution of female and non-White hand surgeons in the AI-generated images with current demographic data of hand surgeons in the United States. Statistical significance was established at alpha = 0.01. RESULTS: Cohen’s kappa for racial agreement across three AI platforms was 0.608 (moderate to substantial agreement), and for gender agreement was 1 (perfect agreement). Cohen’s kappa did not differ when comparing each AI platform for gender or racial agreement. DALL-E 3 showed a significant difference between percentage of rater identified whites and non-whites when compared to the national average of PR (plastic and reconstructive) surgeons (76.6% white, p<0.01)-- image output showed 64% white PR surgeons. On the contrary, DALL-E 3 did not show a significant difference between image output percent males (91%) and the national average of PR male surgeons (83%, p=0.03). Midjourney image outputs favored white (100%), male (100%) PR surgeons, and this was significantly higher than the national average (p<0.01). DreamStudio showed outputs reflective of the national average of male PR surgeons (81%, p=0.59), but showed significantly more white PR surgeons (97%) than the national average. CONCLUSION: As AI technologies continue to shape healthcare, our study aims to underscore the urgency of cultivating more inclusive AI datasets that accurately reflect the growing diversity within the hand surgery profession. Addressing this gap is crucial for fostering equitable advancements in AI applications, enhancing medical education, and ensuring a comprehensive understanding of hand surgery. REFERENCES: 1. Cirillo D, Catuara-Solarz S, Morey C, et al. Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare. NPJ Digit Med. 2020;3:81. Published 2020 Jun 1. doi:10.1038/s41746-020-0288-5 2. Dacus AR, Behar B, Washington K. Advocacy for Diversity in Hand Surgery. Hand Clin. 2023;39(1):25-31. doi:10.1016/j.hcl.2022.08.011
Ähnliche Arbeiten
Radiological Assessment of Osteo-Arthrosis
1957 · 12.262 Zit.
Headache Classification Committee of the International Headache Society (IHS) The International Classification of Headache Disorders, 3rd edition
2018 · 10.226 Zit.
Manual of histologic staining methods of the Armed forces institute of pathology
1968 · 4.939 Zit.
Gray's anatomy: the anatomical basis of clinical practice
2005 · 4.176 Zit.
Histochemistry: Theoretical and Applied
1961 · 3.065 Zit.