Judging with Personality and Confidence: A Study on Personality-Conditioned LLM Relevance Assessment

Nuo Chen; Hanpei Fang; Piaohong Wang; Jiqun Liu; Tetsuya Sakai; Xiao-Ming Wu

arXiv:2601.01862·cs.CL·January 6, 2026

Judging with Personality and Confidence: A Study on Personality-Conditioned LLM Relevance Assessment

Nuo Chen, Hanpei Fang, Piaohong Wang, Jiqun Liu, Tetsuya Sakai, Xiao-Ming Wu

PDF

Open Access

TL;DR

This study explores how simulating different personality traits in large language models affects their relevance judgments and confidence calibration in web search tasks, leading to improved evaluation methods.

Contribution

It introduces a systematic analysis of personality-conditioned LLM relevance assessment and demonstrates enhanced evaluation accuracy using personality-derived confidence features.

Findings

01

Low agreeableness aligns better with human labels.

02

Low conscientiousness balances overconfidence and underconfidence.

03

Personality-conditioned scores improve relevance assessment performance.

Abstract

Recent studies have shown that prompting can enable large language models (LLMs) to simulate specific personality traits and produce behaviors that align with those traits. However, there is limited understanding of how these simulated personalities influence critical web search decisions, specifically relevance assessment. Moreover, few studies have examined how simulated personalities impact confidence calibration, specifically the tendencies toward overconfidence or underconfidence. This gap exists even though psychological literature suggests these biases are trait-specific, often linking high extraversion to overconfidence and high neuroticism to underconfidence. To address this gap, we conducted a comprehensive study evaluating multiple LLMs, including commercial models and open-source models, prompted to simulate Big Five personality traits. We tested these models across three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Retrieval and Search Behavior · Personality Traits and Psychology · Topic Modeling