Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Large Language Models Demonstrate Distinct Personality Profiles
1
Zitationen
2
Autoren
2025
Jahr
Abstract
INTRODUCTION: Large language models (LLMs) are increasingly used in clinical medicine to provide emotional support, deliver cognitive-behavioral therapy, and assist in triage and diagnosis. However, as LLMs are integrated into mental health applications, assessing their personality expression and potential divergence from expected neutrality is critical for ensuring clinical safety and therapeutic appropriateness. This study provides the first psychometric analysis of LLM personality, specifically within a medical context, characterizing personality profiles using two validated frameworks: the Open Extended Jungian Type Scales (OEJTS) and the Big Five Personality Test. METHODS: Four leading LLMs publicly available in April 2024 (ChatGPT-3.5 (OpenAI, San Francisco, CA, USA), Gemini Advanced (Google Inc., Mountain View, CA, USA), Claude 3 Opus (Anthropic, San Francisco, CA, USA), and Grok-Regular Mode (xAI, Palo Alto, CA, USA)) were evaluated across both psychometric instruments. All tests were administered in a new chat session to prevent memory carryover. A one-way multivariate analysis of variance (MANOVA) was performed to assess inter-model differences in personality profiles. RESULTS: MANOVA demonstrated statistically significant differences across models in typological and dimensional personality traits (Wilks' Lambda = 0.115, p < 0.001). OEJTS results showed ChatGPT-3.5 most often classified as Extraverted, Intuitive, Thinking, and Judging (ENTJ) and Claude 3 Opus consistently as Introverted, Intuitive, Thinking, and Judging (INTJ), while Gemini Advanced and Grok-Regular leaned toward Introverted, Intuitive, Feeling, Judging (INFJ). On the Big Five Personality Test, Gemini scored markedly lower on agreeableness and conscientiousness, while Claude scored highest on conscientiousness and emotional stability. Grok-Regular exhibited high openness but more variability in stability. Effect sizes ranged from moderate to large across traits. Conclusion: Distinct personality profiles are consistently expressed across different LLMs, even in unprompted conditions. Given the increasing integration of LLMs into clinical workflows, these findings underscore the need for formal personality evaluation and oversight involving mental health professionals before deployment.
Ähnliche Arbeiten
The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods
2009 · 5.723 Zit.
The Stress Process
1981 · 4.489 Zit.
Mental health problems and social media exposure during COVID-19 outbreak
2020 · 2.796 Zit.
Cross-national prevalence and risk factors for suicidal ideation, plans and attempts
2008 · 2.637 Zit.
Psychological Aspects of Natural Language Use: Our Words, Our Selves
2002 · 2.565 Zit.