Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Analyzing why AI struggles with drawing human hands with CLIP
1
Zitationen
6
Autoren
2025
Jahr
Abstract
<ns3:p>Background Artificial Intelligence (AI) has made significant strides in various domains, but generating realistic human hands remains a challenge. This study explores the limitations of AI in capturing the fine details and proportions of hands, using Contrastive Language Image Pretraining (CLIP) as a case study. Methods Our analysis reveals that CLIP struggles to accurately represent hands due to inadequate training data, anatomical complexities, and practical challenges. We conducted a series of tests and analyses to identify the primary causes of CLIP’s difficulties. Results Our results show that CLIP’s struggles stem from data biases and insufficient anatomical representation in training datasets. Specifically, we found distorted finger relationships, inaccurate proportions, and deviations from expected hand geometry. Conclusion This study aims to provide a comprehensive examination of the current limitations and propose possible directions for future research. By leveraging CLIP for evaluation, control algorithms for structure enforcement, DALL-E for generation, AR for gesture tracking, and 3D modeling for anatomical accuracy, we can overcome the challenges of generating realistic human hands and advance AI’s capabilities in artistic creativity</ns3:p>
Ähnliche Arbeiten
MizAR 60 for Mizar 50
2023 · 74.131 Zit.
ImageNet: A large-scale hierarchical image database
2009 · 60.466 Zit.
Microsoft COCO: Common Objects in Context
2014 · 41.113 Zit.
Fully convolutional networks for semantic segmentation
2015 · 36.288 Zit.
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
2017 · 20.310 Zit.