Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

AI Text-to-Image Generators and the Lack of Diversity in Hand Surgeon Demographic Representation

2024·5 Zitationen·Plastic & Reconstructive Surgery Global OpenOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2024

Jahr

Abstract

PURPOSE: Artificial intelligence (AI) models are already being extensively applied in medicine; however, recent studies have revealed the existence of significant gender and racial gaps with the utilization of AI in the care and education of patients. Resultantly, there is a growing concern that these gaps may lead to unintended biases and inequalities in patient care (1). Furthermore, demographic disparities have been established in many surgical subspecialties, including hand surgery, with women and people of color often in the minority (2). This paper intends to analyze the demographic representation of hand surgeons in AI-generated visuals models in order to shed light on any disparities and analyze the consequential implications for both the medical community and broader society. METHODS: We assessed three of the most popular and publicly available AI text-to-image generators, including DALL-E 3, Midjourney, and DreamStudio. Images were generated using the prompt “a photo of the face of a hand surgeon.” Three reviewers independently evaluated over 300 AI-generated images, categorizing them according to gender (female and male) and race (non-White, defined as any race other than non-Hispanic White, and White). Inter-rater reliability was determined using Cohen’s Kappa. Chi-square was performed to compare the distribution of female and non-White hand surgeons in the AI-generated images with current demographic data of hand surgeons in the United States. Statistical significance was established at alpha = 0.01. RESULTS: Cohen’s kappa for racial agreement across three AI platforms was 0.608 (moderate to substantial agreement), and for gender agreement was 1 (perfect agreement). Cohen’s kappa did not differ when comparing each AI platform for gender or racial agreement. DALL-E 3 showed a significant difference between percentage of rater identified whites and non-whites when compared to the national average of PR (plastic and reconstructive) surgeons (76.6% white, p<0.01)-- image output showed 64% white PR surgeons. On the contrary, DALL-E 3 did not show a significant difference between image output percent males (91%) and the national average of PR male surgeons (83%, p=0.03). Midjourney image outputs favored white (100%), male (100%) PR surgeons, and this was significantly higher than the national average (p<0.01). DreamStudio showed outputs reflective of the national average of male PR surgeons (81%, p=0.59), but showed significantly more white PR surgeons (97%) than the national average. CONCLUSION: As AI technologies continue to shape healthcare, our study aims to underscore the urgency of cultivating more inclusive AI datasets that accurately reflect the growing diversity within the hand surgery profession. Addressing this gap is crucial for fostering equitable advancements in AI applications, enhancing medical education, and ensuring a comprehensive understanding of hand surgery. REFERENCES: 1. Cirillo D, Catuara-Solarz S, Morey C, et al. Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare. NPJ Digit Med. 2020;3:81. Published 2020 Jun 1. doi:10.1038/s41746-020-0288-5 2. Dacus AR, Behar B, Washington K. Advocacy for Diversity in Hand Surgery. Hand Clin. 2023;39(1):25-31. doi:10.1016/j.hcl.2022.08.011

Autoren

Institutionen

Mayo Clinic in Arizona(US)

Themen

Medical and Biological SciencesArtificial Intelligence in Healthcare and EducationDiversity and Career in Medicine

Volltext beim Verlag öffnen

AI Text-to-Image Generators and the Lack of Diversity in Hand Surgeon Demographic Representation

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen