Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
The Rise and Impact of Modern Generative AI Tools: A Comparative Study of Chatgpt, Gemini, Claude
0
Zitationen
2
Autoren
2026
Jahr
Abstract
The present research utilizes three popular multimodal AI systems (Gemini, Cloud AI, ChatGPT) to evaluate their ability to interpret visual language by analyzing responses to three different images. In addition to evaluating the systems' ability to produce accurate and detailed descriptions of each image, this research evaluated their ability to contextualize each description within an appropriate framework of understanding, as well as assess their response times. Results indicate that ChatGPT provided the most accurate and descriptive descriptions of the images analyzed in this study, particularly those which depicted emotionally and/or socially nuanced scenes; that Gemini performed reasonably well in terms of conceptual interpretation, though was inconsistent in its provision of specific details regarding the image(s); and that while Cloud AI responded more quickly than either ChatGPT or Gemini, it failed to provide as much detail or relevance to the situation presented in the images. These findings emphasize the need to develop multimodal AI systems that balance speed, emotional intelligence, and semantic accuracy to be used in the real world when reasoning with images.
Ähnliche Arbeiten
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
2017 · 20.326 Zit.
Generative Adversarial Nets
2023 · 19.841 Zit.
Visualizing and Understanding Convolutional Networks
2014 · 15.241 Zit.
"Why Should I Trust You?"
2016 · 14.218 Zit.
On a Method to Measure Supervised Multiclass Model’s Interpretability: Application to Degradation Diagnosis (Short Paper)
2024 · 13.111 Zit.