Principled Personas: Defining and Measuring the Intended Effects of Persona Prompting on Task Performance

Pedro Henrique Luz de Araujo; Paul R\"ottger; Dirk Hovy; Benjamin Roth

arXiv:2508.19764·cs.CL·December 16, 2025

Principled Personas: Defining and Measuring the Intended Effects of Persona Prompting on Task Performance

Pedro Henrique Luz de Araujo, Paul R\"ottger, Dirk Hovy, Benjamin Roth

PDF

1 Video

TL;DR

This paper investigates how expert persona prompting affects task performance in language models, revealing that while it can improve results, irrelevant details often harm performance, highlighting the need for careful design and evaluation.

Contribution

It provides a comprehensive analysis of persona prompting effects, introduces three key desiderata, and evaluates strategies to enhance robustness across different models and tasks.

Findings

01

Expert personas often improve or do not significantly change performance.

02

Irrelevant persona details can cause performance drops up to 30%.

03

Mitigation strategies are effective only for the most capable models.

Abstract

Expert persona prompting -- assigning roles such as expert in math to language models -- is widely used for task improvement. However, prior work shows mixed results on its effectiveness, and does not consider when and why personas should improve performance. We analyze the literature on persona prompting for task improvement and distill three desiderata: 1) performance advantage of expert personas, 2) robustness to irrelevant persona attributes, and 3) fidelity to persona attributes. We then evaluate 9 state-of-the-art LLMs across 27 tasks with respect to these desiderata. We find that expert personas usually lead to positive or non-significant performance changes. Surprisingly, models are highly sensitive to irrelevant persona details, with performance drops of almost 30 percentage points. In terms of fidelity, we find that while higher education, specialization, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Principled Personas: Defining and Measuring the Intended Effects of Persona Prompting on Task Performance· underline