Limited Ability of LLMs to Simulate Human Psychological Behaviours: a   Psychometric Analysis

Nikolay B Petrov; Gregory Serapio-Garc\'ia; Jason Rentfrow

arXiv:2405.07248·cs.CL·May 14, 2024·6 cites

Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis

Nikolay B Petrov, Gregory Serapio-Garc\'ia, Jason Rentfrow

PDF

Open Access 1 Repo

TL;DR

This study evaluates the capacity of GPT-3.5 and GPT-4 to simulate human psychological traits through standardized questionnaires, revealing GPT-4's limited success and questioning LLMs' ability to mimic individual human behaviors accurately.

Contribution

The paper applies psychometric analysis to assess LLMs' ability to simulate human psychological profiles, highlighting limitations especially with demographic-specific prompts.

Findings

01

GPT-4 shows some promising psychometric properties with generic personas.

02

Responses from both models with demographic profiles show poor psychometric validity.

03

Current LLMs are inadequate for accurately simulating individual human psychological traits.

Abstract

The humanlike responses of large language models (LLMs) have prompted social scientists to investigate whether LLMs can be used to simulate human participants in experiments, opinion polls and surveys. Of central interest in this line of research has been mapping out the psychological profiles of LLMs by prompting them to respond to standardized questionnaires. The conflicting findings of this research are unsurprising given that mapping out underlying, or latent, traits from LLMs' text responses to questionnaires is no easy task. To address this, we use psychometrics, the science of psychological measurement. In this study, we prompt OpenAI's flagship models, GPT-3.5 and GPT-4, to assume different personas and respond to a range of standardized measures of personality constructs. We used two kinds of persona descriptions: either generic (four or five random person descriptions) or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nikbpetrov/llms-simulate-humans
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Position-Wise Feed-Forward Layer · Cosine Annealing · Dropout · Linear Warmup With Cosine Annealing · Label Smoothing · Residual Connection · Absolute Position Encodings