Multi-Agent Teams Hold Experts Back

Aneesh Pappu; Batu El; Hancheng Cao; Carmelo di Nolfo; Yanchao Sun; Meng Cao; James Zou

arXiv:2602.01011·cs.MA·February 10, 2026·2 cites

Multi-Agent Teams Hold Experts Back

Aneesh Pappu, Batu El, Hancheng Cao, Carmelo di Nolfo, Yanchao Sun, Meng Cao, James Zou

PDF

Open Access

TL;DR

This paper investigates the performance of self-organizing multi-agent LLM teams, revealing they often underperform compared to individual experts due to poor expertise leveraging and consensus behaviors, highlighting a gap in emergent coordination.

Contribution

It provides the first systematic analysis of self-organizing LLM teams' ability to leverage expertise, identifying key bottlenecks and behaviors affecting team performance.

Findings

01

LLM teams fail to match expert performance, with up to 37.6% loss.

02

Expert leveraging is the main bottleneck, not identification.

03

Consensus-seeking reduces effective expertise utilization.

Abstract

Multi-agent LLM systems are increasingly deployed as autonomous collaborators, where agents interact freely rather than execute fixed, pre-specified workflows. In such settings, effective coordination cannot be fully designed in advance and must instead emerge through interaction. However, most prior work enforces coordination through fixed roles, workflows, or aggregation rules, leaving open the question of how well self-organizing teams perform when coordination is unconstrained. Drawing on organizational psychology, we study whether self-organizing LLM teams achieve strong synergy, where team performance matches or exceeds the best individual member. Across human-inspired and frontier ML benchmarks, we find that -- unlike human teams -- LLM teams consistently fail to match their expert agent's performance, even when explicitly told who the expert is, incurring performance losses of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Multi-Agent Systems and Negotiation · Language and cultural evolution