Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning

Rui Jerry Huang; Wendy Liu; Anastasia Miin; Lei Ding

arXiv:2510.18179·cs.MA·October 23, 2025

Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning

Rui Jerry Huang, Wendy Liu, Anastasia Miin, Lei Ding

PDF

Open Access

TL;DR

This paper introduces Adaptive Coopetition (AdCo), a novel inference-time framework where LLM agents use coarse verifier signals to adaptively collaborate or compete, significantly improving multi-agent reasoning robustness without relying on high-performance verifiers.

Contribution

The paper proposes a new adaptive, UCB-based coopetition mechanism for LLM agents that enhances reasoning performance and robustness at inference time, addressing limitations of existing methods.

Findings

01

Achieves 20% relative improvement on mathematical reasoning benchmarks.

02

Maintains robustness and accuracy across different sample sizes and configurations.

03

Leverages model diversity and reasoning trace measures for improved resilience.

Abstract

Inference-time computation is a critical yet challenging paradigm for enhancing the reasoning performance of large language models (LLMs). While existing strategies improve reasoning stability and consistency, they suffer from notable limitations: self-correction often reinforces the model's initial biases, and Multi-Agent Collaboration (MAC) often fails due to the lack of efficient coordination mechanisms, leading to collective errors. Although high-performing verifiers can detect reasoning errors, making them reliable requires substantial training. To address these challenges, we introduce a novel inference-time framework, Adaptive Coopetition (AdCo), in which LLM agents utilize an adaptive, UCB-based "coopetition" mechanism. At each round, agents leverage coarse verifier signals to determine whether to collaborate or compete, and iteratively refine their reasoning based on peer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Multimodal Machine Learning Applications