Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning
Rui Jerry Huang, Wendy Liu, Anastasia Miin, Lei Ding

TL;DR
This paper introduces Adaptive Coopetition (AdCo), a novel inference-time framework where LLM agents use coarse verifier signals to adaptively collaborate or compete, significantly improving multi-agent reasoning robustness without relying on high-performance verifiers.
Contribution
The paper proposes a new adaptive, UCB-based coopetition mechanism for LLM agents that enhances reasoning performance and robustness at inference time, addressing limitations of existing methods.
Findings
Achieves 20% relative improvement on mathematical reasoning benchmarks.
Maintains robustness and accuracy across different sample sizes and configurations.
Leverages model diversity and reasoning trace measures for improved resilience.
Abstract
Inference-time computation is a critical yet challenging paradigm for enhancing the reasoning performance of large language models (LLMs). While existing strategies improve reasoning stability and consistency, they suffer from notable limitations: self-correction often reinforces the model's initial biases, and Multi-Agent Collaboration (MAC) often fails due to the lack of efficient coordination mechanisms, leading to collective errors. Although high-performing verifiers can detect reasoning errors, making them reliable requires substantial training. To address these challenges, we introduce a novel inference-time framework, Adaptive Coopetition (AdCo), in which LLM agents utilize an adaptive, UCB-based "coopetition" mechanism. At each round, agents leverage coarse verifier signals to determine whether to collaborate or compete, and iteratively refine their reasoning based on peer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Multimodal Machine Learning Applications
