The Geometry of Persona: Disentangling Personality from Reasoning in Large Language Models

Zhixiang Wang

arXiv:2512.07092·cs.LG·December 9, 2025

The Geometry of Persona: Disentangling Personality from Reasoning in Large Language Models

Zhixiang Wang

PDF

Open Access

TL;DR

This paper introduces a geometric framework for disentangling personality traits from reasoning in large language models, enabling precise, controllable personalization without fine-tuning.

Contribution

It proposes the Soul Engine framework based on the Linear Representation Hypothesis, allowing for orthogonal personality vectors extraction without modifying model weights.

Findings

01

Achieved high-precision personality profiling with low MSE

02

Confirmed distinct and continuous personality manifolds via visualization

03

Enabled robust, deterministic control over model behavior

Abstract

Background: The deployment of personalized Large Language Models (LLMs) is currently constrained by the stability-plasticity dilemma. Prevailing alignment methods, such as Supervised Fine-Tuning (SFT), rely on stochastic weight updates that often incur an "alignment tax" -- degrading general reasoning capabilities. Methods: We propose the Soul Engine, a framework based on the Linear Representation Hypothesis, which posits that personality traits exist as orthogonal linear subspaces. We introduce SoulBench, a dataset constructed via dynamic contextual sampling. Using a dual-head architecture on a frozen Qwen-2.5 base, we extract disentangled personality vectors without modifying the backbone weights. Results: Our experiments demonstrate three breakthroughs. First, High-Precision Profiling: The model achieves a Mean Squared Error (MSE) of 0.011 against psychological ground truth.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPersona Design and Applications · Personality Traits and Psychology · Mental Health via Writing