The Value of Variance: Mitigating Debate Collapse in Multi-Agent Systems via Uncertainty-Driven Policy Optimization

Luoxi Tang; Yuqiao Meng; Joseph Costa; Yingxue Zhang; Muchao Ye; Zhaohan Xi

arXiv:2602.07186·cs.MA·February 10, 2026

The Value of Variance: Mitigating Debate Collapse in Multi-Agent Systems via Uncertainty-Driven Policy Optimization

Luoxi Tang, Yuqiao Meng, Joseph Costa, Yingxue Zhang, Muchao Ye, Zhaohan Xi

PDF

Open Access

TL;DR

This paper introduces a hierarchical uncertainty metric for multi-agent debate systems, enabling detection of failures and proposing an uncertainty-driven policy to improve decision accuracy and system reliability.

Contribution

It develops a novel hierarchical uncertainty quantification method and an uncertainty-driven policy optimization to mitigate debate collapse in multi-agent systems.

Findings

01

Uncertainty metrics reliably indicate system failures.

02

Mitigation improves decision accuracy.

03

Reduces system disagreement.

Abstract

Multi-agent debate (MAD) systems improve LLM reasoning through iterative deliberation, but remain vulnerable to debate collapse, a failure type where final agent decisions are compromised on erroneous reasoning. Existing methods lack principled mechanisms to detect or prevent such failures. To address this gap, we first propose a hierarchical metric that quantifies behavioral uncertainty at three levels: intra-agent (individual reasoning uncertainty), inter-agent (interactive uncertainty), and system-level (output uncertainty). Empirical analysis across several benchmarks reveals that our proposed uncertainty quantification reliably indicates system failures, which demonstrates the validity of using them as diagnostic metrics to indicate the system failure. Subsequently, we propose a mitigation strategy by formulating an uncertainty-driven policy optimization to penalize…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Game Theory and Applications · Reinforcement Learning in Robotics