Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis
Nikolay B Petrov, Gregory Serapio-Garc\'ia, Jason Rentfrow

TL;DR
This study evaluates the capacity of GPT-3.5 and GPT-4 to simulate human psychological traits through standardized questionnaires, revealing GPT-4's limited success and questioning LLMs' ability to mimic individual human behaviors accurately.
Contribution
The paper applies psychometric analysis to assess LLMs' ability to simulate human psychological profiles, highlighting limitations especially with demographic-specific prompts.
Findings
GPT-4 shows some promising psychometric properties with generic personas.
Responses from both models with demographic profiles show poor psychometric validity.
Current LLMs are inadequate for accurately simulating individual human psychological traits.
Abstract
The humanlike responses of large language models (LLMs) have prompted social scientists to investigate whether LLMs can be used to simulate human participants in experiments, opinion polls and surveys. Of central interest in this line of research has been mapping out the psychological profiles of LLMs by prompting them to respond to standardized questionnaires. The conflicting findings of this research are unsurprising given that mapping out underlying, or latent, traits from LLMs' text responses to questionnaires is no easy task. To address this, we use psychometrics, the science of psychological measurement. In this study, we prompt OpenAI's flagship models, GPT-3.5 and GPT-4, to assume different personas and respond to a range of standardized measures of personality constructs. We used two kinds of persona descriptions: either generic (four or five random person descriptions) or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Position-Wise Feed-Forward Layer · Cosine Annealing · Dropout · Linear Warmup With Cosine Annealing · Label Smoothing · Residual Connection · Absolute Position Encodings
