PTCBENCH: Benchmarking Contextual Stability of Personality Traits in LLM Systems

Jiongchi Yu; Yuhan Ma; Xiaoyu Zhang; Junjie Wang; Qiang Hu; Chao Shen; Xiaofei Xie

arXiv:2602.00016·cs.CL·February 3, 2026

PTCBENCH: Benchmarking Contextual Stability of Personality Traits in LLM Systems

Jiongchi Yu, Yuhan Ma, Xiaoyu Zhang, Junjie Wang, Qiang Hu, Chao Shen, Xiaofei Xie

PDF

Open Access

TL;DR

This paper introduces PTCBENCH, a benchmark for evaluating how consistent large language model personalities remain across various contexts, highlighting significant personality shifts triggered by external scenarios.

Contribution

It presents a systematic framework for measuring LLM personality stability under diverse situational conditions, addressing a gap in existing research.

Findings

01

Certain external scenarios cause significant personality changes in LLMs

02

Personality shifts can impact LLM reasoning capabilities

03

PTCBENCH provides an extensible framework for realistic environment evaluation

Abstract

With the increasing deployment of large language models (LLMs) in affective agents and AI systems, maintaining a consistent and authentic LLM personality becomes critical for user trust and engagement. However, existing work overlooks a fundamental psychological consensus that personality traits are dynamic and context-dependent. To bridge this gap, we introduce PTCBENCH, a systematic benchmark designed to quantify the consistency of LLM personalities under controlled situational contexts. PTCBENCH subjects models to 12 distinct external conditions spanning diverse location contexts and life events, and rigorously assesses the personality using the NEO Five-Factor Inventory. Our study on 39,240 personality trait records reveals that certain external scenarios (e.g., "Unemployment") can trigger significant personality changes of LLMs, and even alter their reasoning capabilities. Overall,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPersonality Traits and Psychology · Mental Health via Writing · Digital Mental Health Interventions