LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

Maojia Song; Tej Deep Pala; Ruiwen Zhou; Weisheng Jin; Amir Zadeh; Chuan Li; Dorien Herremans; Soujanya Poria

arXiv:2508.18321·cs.CL·December 10, 2025

LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

Maojia Song, Tej Deep Pala, Ruiwen Zhou, Weisheng Jin, Amir Zadeh, Chuan Li, Dorien Herremans, Soujanya Poria

PDF

1 Datasets

TL;DR

This paper introduces KAIROS, a benchmark for analyzing how large language models interact with peers in social settings, revealing that larger models are more resilient to social influence and that training methods can improve robustness.

Contribution

The paper presents KAIROS, a novel benchmark for studying LLMs in multi-agent social interactions, and systematically evaluates how model size and training influence susceptibility to peer influence.

Findings

01

Larger models are more resistant to social influence.

02

Prompting helps mitigate peer pressure effects in larger models.

03

GRPO training improves robustness in small models.

Abstract

Large language models (LLMs) are increasingly integrated into multi-agent systems (MAS), where peer interactions shape individual decisions. While prior work has mainly examined conformity bias, we broaden the view to include how LLMs build rapport from prior interactions, discern and integrate high-quality peer information, and resist misleading inputs-abilities essential for achieving collective intelligence under complex social dynamics. We introduce KAIROS, a benchmark that simulates quiz-style collaboration with peer agents whose rapport levels and behaviours can be precisely controlled in both historical interactions and the current round. This unified setup enables systematic analysis of how rapport, peer actions, and the model's self-confidence jointly influence decision-making. Using KAIROS, we evaluate prompting, supervised fine-tuning, and reinforcement learning via Group…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

declare-lab/KAIROS_EVAL
dataset· 191 dl
191 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.