Cognitive phantoms in LLMs through the lens of latent variables

Sanne Peereboom; Inga Schwabe; Bennett Kleinberg

arXiv:2409.15324·cs.AI·May 13, 2025

Cognitive phantoms in LLMs through the lens of latent variables

Sanne Peereboom, Inga Schwabe, Bennett Kleinberg

PDF

Open Access

TL;DR

This paper critically examines whether human psychometric questionnaires validly measure personality traits in large language models, revealing that such constructs may not exist in LLMs and emphasizing the need for proper psychometric analysis.

Contribution

It demonstrates that human-designed personality questionnaires do not validly assess LLMs, highlighting the importance of psychometric validation in machine behavior studies.

Findings

01

Questionnaires do not measure similar constructs in LLMs as in humans.

02

Constructs may not exist in LLMs at all.

03

Highlights the need for psychometric validation of LLM assessments.

Abstract

Large language models (LLMs) increasingly reach real-world applications, necessitating a better understanding of their behaviour. Their size and complexity complicate traditional assessment methods, causing the emergence of alternative approaches inspired by the field of psychology. Recent studies administering psychometric questionnaires to LLMs report human-like traits in LLMs, potentially influencing LLM behaviour. However, this approach suffers from a validity problem: it presupposes that these traits exist in LLMs and that they are measurable with tools designed for humans. Typical procedures rarely acknowledge the validity problem in LLMs, comparing and interpreting average LLM scores. This study investigates this problem by comparing latent structures of personality between humans and three LLMs using two validated personality questionnaires. Findings suggest that questionnaires…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management