From Single to Societal: Analyzing Persona-Induced Bias in Multi-Agent Interactions
Jiayi Li, Xiao Liu, Yansong Feng

TL;DR
This paper systematically investigates how assigning personas to LLM-based multi-agent systems introduces biases in social traits like trustworthiness and insistence, revealing persistent in-group favoritism and biases across various settings.
Contribution
It uncovers and analyzes the presence of social biases induced by personas in multi-agent interactions, emphasizing the need for bias mitigation in such systems.
Findings
Agents with historically advantaged personas are perceived as less trustworthy.
Agents show in-group favoritism, conforming more to shared personas.
Biases persist across different LLMs, group sizes, and interaction rounds.
Abstract
Large Language Model (LLM)-based multi-agent systems are increasingly used to simulate human interactions and solve collaborative tasks. A common practice is to assign agents with personas to encourage behavioral diversity. However, this raises a critical yet underexplored question: do personas introduce biases into multi-agent interactions? This paper presents a systematic investigation into persona-induced biases in multi-agent interactions, with a focus on social traits like trustworthiness (how an agent's opinion is received by others) and insistence (how strongly an agent advocates for its opinion). Through a series of controlled experiments in collaborative problem-solving and persuasion tasks, we reveal that (1) LLM-based agents exhibit biases in both trustworthiness and insistence, with personas from historically advantaged groups (e.g., men and White individuals) perceived as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsPersona Design and Applications · AI in Service Interactions · Social Robot Interaction and HRI
