From Five Dimensions to Many: Large Language Models as Precise and Interpretable Psychological Profilers
Yi-Fei Liu, Yi-Long Lu, Di He, Hang Zhang

TL;DR
This study shows that large language models can accurately model human psychological trait structures from minimal data, using a systematic reasoning process that involves summarization and abstraction, surpassing simple semantic similarity predictions.
Contribution
The paper introduces a novel approach demonstrating that LLMs can predict psychological profiles with high accuracy by mimicking human reasoning and summarization processes, revealing emergent trait interplay.
Findings
LLMs accurately replicate human inter-trait correlation patterns (R^2 > 0.89)
LLMs' zero-shot performance exceeds semantic similarity predictions
Summarization enhances trait prediction accuracy by capturing emergent patterns
Abstract
Psychological constructs within individuals are widely believed to be interconnected. We investigated whether and how Large Language Models (LLMs) can model the correlational structure of human psychological traits from minimal quantitative inputs. We prompted various LLMs with Big Five Personality Scale responses from 816 human individuals to role-play their responses on nine other psychological scales. LLMs demonstrated remarkable accuracy in capturing human psychological structure, with the inter-scale correlation patterns from LLM-generated responses strongly aligning with those from human data . This zero-shot performance substantially exceeded predictions based on semantic similarity and approached the accuracy of machine learning algorithms trained directly on the dataset. Analysis of reasoning traces revealed that LLMs use a systematic two-stage process: First,…
Peer Reviews
Decision·ICLR 2026 Poster
The authors suggest a great research question, find an answer to it and demonstrate the validity of the answer in a convincing way.
I am not an expert in classical psychology literature, but some of the human-score correlations should be taken with a grain of salt due to the fact that they are often reported on small selection of subjects that are not representing the scale of human experience that an LLM is exposed to. In this context the stronger correlation that the authors report is even more surprizing to me, especially in the context of the later remark on more "attentive" subjects.
- Moving from the first‑order prediction to the second‑order correlation‑structure analysis is an interesting reframing of the psychology questions with LLMs. The heatmaps and regression plots in Figure 2 (page 5) compellingly visualize the phenomenon. - An attempt to mechanism‑seeking analysis: factor‑level and item‑level attribution comparison in Figure 4, and amplification depending on information type in Figure 5 are good steps toward understanding the model's decisions rather than reportin
- Related Work (Section 2) significantly lacks depth to situate the paper within existing literature. In particular, it overlooks a growing body of research suggesting that LLMs exhibit latent psychological structure of humans. Few papers I found by search are “Personallm: Investigating the ability of gpt-3.5 to express personality traits and gender differences” (Jiang et al.), “Personality traits in large language models” (Serapio-Garcia et al.), “Rediscovering the Latent Dimensions of Persona
The paper is well-argued and well-structured. By comparing scores across a range of psychometric tests, the experimental design address a fundamental gap in the literature, namely with regard to the coherence and stability of LLM personalities. Further, the experimental setup allows for comparisons across models and across learning algorithms, which strengthens the findings. The paper offers an insightful analysis and discussion of LLMs tendency towards idealized representations.
The correlational analysis of Experiment 1 presented in 3.2 is interesting from an exploratory perspective but it is not very convincing and slightly problematic: Multiple comparisons are discouraged and should be replaced with more robust multivariate tests. Indeed, experiment 2 and the subsequent analysis in 4.2 is much more conclusive.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Personality Traits and Psychology · Mental Health Research Topics
