From Five Dimensions to Many: Large Language Models as Precise and Interpretable Psychological Profilers

Yi-Fei Liu; Yi-Long Lu; Di He; Hang Zhang

arXiv:2511.03235·cs.AI·March 24, 2026

From Five Dimensions to Many: Large Language Models as Precise and Interpretable Psychological Profilers

Yi-Fei Liu, Yi-Long Lu, Di He, Hang Zhang

PDF

Open Access 3 Reviews

TL;DR

This study shows that large language models can accurately model human psychological trait structures from minimal data, using a systematic reasoning process that involves summarization and abstraction, surpassing simple semantic similarity predictions.

Contribution

The paper introduces a novel approach demonstrating that LLMs can predict psychological profiles with high accuracy by mimicking human reasoning and summarization processes, revealing emergent trait interplay.

Findings

01

LLMs accurately replicate human inter-trait correlation patterns (R^2 > 0.89)

02

LLMs' zero-shot performance exceeds semantic similarity predictions

03

Summarization enhances trait prediction accuracy by capturing emergent patterns

Abstract

Psychological constructs within individuals are widely believed to be interconnected. We investigated whether and how Large Language Models (LLMs) can model the correlational structure of human psychological traits from minimal quantitative inputs. We prompted various LLMs with Big Five Personality Scale responses from 816 human individuals to role-play their responses on nine other psychological scales. LLMs demonstrated remarkable accuracy in capturing human psychological structure, with the inter-scale correlation patterns from LLM-generated responses strongly aligning with those from human data $(R^{2} > 0.89)$ . This zero-shot performance substantially exceeded predictions based on semantic similarity and approached the accuracy of machine learning algorithms trained directly on the dataset. Analysis of reasoning traces revealed that LLMs use a systematic two-stage process: First,…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 8Confidence 4

Strengths

The authors suggest a great research question, find an answer to it and demonstrate the validity of the answer in a convincing way.

Weaknesses

I am not an expert in classical psychology literature, but some of the human-score correlations should be taken with a grain of salt due to the fact that they are often reported on small selection of subjects that are not representing the scale of human experience that an LLM is exposed to. In this context the stronger correlation that the authors report is even more surprizing to me, especially in the context of the later remark on more "attentive" subjects.

Reviewer 02Rating 4Confidence 2

Strengths

- Moving from the first‑order prediction to the second‑order correlation‑structure analysis is an interesting reframing of the psychology questions with LLMs. The heatmaps and regression plots in Figure 2 (page 5) compellingly visualize the phenomenon. - An attempt to mechanism‑seeking analysis: factor‑level and item‑level attribution comparison in Figure 4, and amplification depending on information type in Figure 5 are good steps toward understanding the model's decisions rather than reportin

Weaknesses

- Related Work (Section 2) significantly lacks depth to situate the paper within existing literature. In particular, it overlooks a growing body of research suggesting that LLMs exhibit latent psychological structure of humans. Few papers I found by search are “Personallm: Investigating the ability of gpt-3.5 to express personality traits and gender differences” (Jiang et al.), “Personality traits in large language models” (Serapio-Garcia et al.), “Rediscovering the Latent Dimensions of Persona

Reviewer 03Rating 8Confidence 4

Strengths

The paper is well-argued and well-structured. By comparing scores across a range of psychometric tests, the experimental design address a fundamental gap in the literature, namely with regard to the coherence and stability of LLM personalities. Further, the experimental setup allows for comparisons across models and across learning algorithms, which strengthens the findings. The paper offers an insightful analysis and discussion of LLMs tendency towards idealized representations.

Weaknesses

The correlational analysis of Experiment 1 presented in 3.2 is interesting from an exploratory perspective but it is not very convincing and slightly problematic: Multiple comparisons are discouraged and should be replaced with more robust multivariate tests. Indeed, experiment 2 and the subsequent analysis in 4.2 is much more conclusive.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Personality Traits and Psychology · Mental Health Research Topics