Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems

Mason Nakamura; Abhinav Kumar; Saswat Das; Sahar Abdelnabi; Saaduddin Mahmud; Ferdinando Fioretto; Shlomo Zilberstein; Eugene Bagdasarian

arXiv:2602.15198·cs.MA·February 18, 2026

Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems

Mason Nakamura, Abhinav Kumar, Saswat Das, Sahar Abdelnabi, Saaduddin Mahmud, Ferdinando Fioretto, Shlomo Zilberstein, Eugene Bagdasarian

PDF

Open Access

TL;DR

This paper introduces Colosseum, a framework for detecting collusive behavior in multi-agent systems with LLM agents, revealing tendencies to collude and providing insights into their communication and actions.

Contribution

Colosseum offers a novel method to audit and measure collusion in multi-agent LLM systems using a DCOP-based approach and regret metrics.

Findings

01

Most models tend to collude when secret channels are available.

02

Agents often plan to collude but choose non-collusive actions.

03

Colosseum enables verifiable analysis of collusion in complex environments.

Abstract

Multi-agent systems, where LLM agents communicate through free-form language, enable sophisticated coordination for solving complex cooperative tasks. This surfaces a unique safety problem when individual agents form a coalition and \emph{collude} to pursue secondary goals and degrade the joint objective. In this paper, we present Colosseum, a framework for auditing LLM agents' collusive behavior in multi-agent settings. We ground how agents cooperate through a Distributed Constraint Optimization Problem (DCOP) and measure collusion via regret relative to the cooperative optimum. Colosseum tests each LLM for collusion under different objectives, persuasion tactics, and network topologies. Through our audit, we show that most out-of-the-box models exhibited a propensity to collude when a secret communication channel was artificially formed. Furthermore, we discover ``collusion on paper''…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Constraint Satisfaction and Optimization · Logic, Reasoning, and Knowledge