Fairness Testing of Large Language Models in Role-Playing

Xinyue Li; Zhenpeng Chen; Jie M. Zhang; Ying Xiao; Tianlin Li; Weisong Sun; Yang Liu; Yiling Lou; Xuanzhe Liu

arXiv:2411.00585·cs.CY·April 23, 2026·5 cites

Fairness Testing of Large Language Models in Role-Playing

Xinyue Li, Zhenpeng Chen, Jie M. Zhang, Ying Xiao, Tianlin Li, Weisong Sun, Yang Liu, Yiling Lou, Xuanzhe Liu

PDF

TL;DR

This study empirically tests fairness in large language models during role-playing by generating diverse social roles and questions, revealing widespread biases across multiple models.

Contribution

It introduces a comprehensive methodology for fairness testing in role-playing scenarios and publicly releases a dataset and tools for future research.

Findings

01

Identified 107,580 biased responses across 10 LLMs.

02

Models produced between 7,579 and 16,963 biased responses each.

03

Generated 33,000 role-specific questions targeting bias.

Abstract

Large Language Models (LLMs) have become foundational in modern language-driven software applications, profoundly influencing daily life. A critical technique in leveraging their potential is role-playing, where LLMs simulate diverse roles to enhance their real-world utility. However, while research has highlighted the presence of social biases in LLM outputs, it remains unclear whether and to what extent these biases emerge during role-playing scenarios. In this paper, we conduct an empirical study on fairness testing of LLMs in role-playing scenarios. To enable this testing, we use LLMs to generate 550 social roles spanning a comprehensive set of 11 demographic attributes, producing 33,000 role-specific questions that target various forms of bias. These questions, covering Yes/No, multiple-choice, and open-ended formats, are designed to prompt LLMs to adopt specific roles and respond…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.