The impact of multi-agent debate protocols on debate quality: a controlled case study
Ramtin Zargari Marandi

TL;DR
This study evaluates how different multi-agent debate protocols influence debate quality, showing that protocol design significantly affects convergence speed, peer referencing, and argument diversity in a macroeconomic case study.
Contribution
It introduces and compares three debate protocols, including a novel Rank-Adaptive Cross-Round method, demonstrating their effects on debate dynamics and outcomes.
Findings
RA-CR achieves faster convergence than CR
WR shows higher peer-referencing
NI maximizes Argument Diversity
Abstract
In multi-agent debate (MAD) systems, performance gains are often reported; however, because the debate protocol (e.g., number of agents, rounds, and aggregation rule) is typically held fixed while model-related factors vary, it is difficult to disentangle protocol effects from model effects. To isolate these effects, we compare three main protocols, Within-Round (WR; agents see only current-round contributions), Cross-Round (CR; full prior-round context), and novel Rank-Adaptive Cross-Round (RA-CR; dynamically reorders agents and silences one per round via an external judge model), against a No-Interaction baseline (NI; independent responses without peer visibility). In a controlled macroeconomic case study (20 diverse events, five random seeds, matched prompts/decoding), RA-CR achieves faster convergence than CR, WR shows higher peer-referencing, and NI maximizes Argument Diversity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
