Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Benchmarking AI acceptability and grammaticality in German: A study of ChatGPT and human judgments
0
Zitationen
1
Autoren
2026
Jahr
Abstract
The rapid development of large language models has opened new avenues for linguistic research, including areas traditionally reliant on native-speaker intuitions. One such domain is grammaticality and acceptability judgment, where speakers assess whether sentences are structurally well-formed and contextually appropriate. This study investigates the extent to which ChatGPT-4 can approximate human judgments in German, focusing on a diverse range of grammatical and usage-related phenomena. A carefully designed set of test items was presented to both the model and native speakers, allowing for a direct comparison. The results show a high degree of alignment in many cases, but also reveal systematic divergences, particularly in contexts involving gradience, sociolinguistic markedness or context-dependent acceptability. These findings demonstrate both the analytical potential and the current limitations of large language models in linguistic research, and contribute to ongoing discussions about their ability to approximate native speaker competence.
Ähnliche Arbeiten
The cortical organization of speech processing
2007 · 5.454 Zit.
Working Memory
1992 · 5.107 Zit.
A theory of lexical access in speech production [target paper]
1999 · 5.062 Zit.
Toward a model of text comprehension and production.
1978 · 4.999 Zit.
Classification of primary progressive aphasia and its variants
2011 · 4.994 Zit.