Unmasking Conversational Bias in AI Multiagent Systems
Erica Coppolillo, Giuseppe Manco, Luca Maria Aiello

TL;DR
This paper introduces a framework to detect biases in multi-agent conversational AI systems by simulating echo chambers, revealing biases undetected by existing methods, and emphasizing the need for advanced bias detection tools.
Contribution
The paper presents a novel framework for quantifying biases in multi-agent LLM systems through simulated discussions, uncovering biases missed by current detection methods.
Findings
Echo chambers amplify conservative biases in LLM conversations.
Current bias detection methods fail to identify biases in multi-agent settings.
Biases in multi-agent systems can significantly differ from isolated model outputs.
Abstract
Detecting biases in the outputs produced by generative models is essential to reduce the potential risks associated with their application in critical settings. However, the majority of existing methodologies for identifying biases in generated text consider the models in isolation and neglect their contextual applications. Specifically, the biases that may arise in multi-agent systems involving generative models remain under-researched. To address this gap, we present a framework designed to quantify biases within multi-agent systems of conversational Large Language Models (LLMs). Our approach involves simulating small echo chambers, where pairs of LLMs, initialized with aligned perspectives on a polarizing topic, engage in discussions. Contrary to expectations, we observe significant shifts in the stance expressed in the generated messages, particularly within echo chambers where all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation
