Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Comparison of effectiveness between ChatGPT 3.5 and 4 in understanding different natural languages
0
Zitationen
4
Autoren
2025
Jahr
Abstract
This paper addresses the multilingual language understanding of ChatGPT‒3.5 and 4 to investigate their performance with respect to languages with different degrees of prevalence on the internet. ChatGPT’s training data mostly consists of website content. As the language distribution is unevenly allocated and a low number of languages is used on websites this should impact performance. Both ChatGPT versions should rate reviews between 1 to 5 stars based solely on the product description and the review texts. Therefore, 500 e‒commerce reviews are collected for each of five languages: English, German, Dutch, Korean and Hindi, which are evenly distributed at 100 reviews per star rating. The evaluation methods and metrics used in this study include t‒tests, confusion matrices, macro F1 values and a defined cumulative star deviation. The results indicate a significant correlation between the degree of dissemination and the accuracy of the ChatGPT‒3.5 evaluation. In direct comparison, ChatGPT‒4 shows superior accuracy in all languages studied, while maintaining acceptable performance in less represented languages. The hypothesis that ChatGPT‒4 scoring accuracy increases with an increase in the number of words in reviews in less represented languages could not be confirmed. These findings illustrate the influence of the selected language on the interaction with ChatGPT and its language comprehension, which suggests that multilingualism should be given greater consideration in the future development and optimization of large language models.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.245 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.102 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.468 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.429 Zit.