Adaptive Confidence Gating in Multi-Agent Collaboration for Efficient and Optimized Code Generation

Haoji Zhang; Yuzhe Li; Zhenqiang Liu; Chenyang Liu; Shenyang Zhang; Yi Zhou

arXiv:2601.21469·cs.SE·January 30, 2026

Adaptive Confidence Gating in Multi-Agent Collaboration for Efficient and Optimized Code Generation

Haoji Zhang, Yuzhe Li, Zhenqiang Liu, Chenyang Liu, Shenyang Zhang, Yi Zhou

PDF

Open Access

TL;DR

DebateCoder is a multi-agent framework that enhances small language models' reasoning for code generation by using adaptive confidence gating, multi-turn deliberation, and review mechanisms, achieving higher accuracy with less API usage.

Contribution

This work introduces DebateCoder, a novel multi-agent collaborative approach with adaptive confidence gating to improve small language models' reasoning in code generation tasks.

Findings

01

Achieves 70.12% Pass@1 on HumanEval, surpassing MapCoder.

02

Reduces API overhead by approximately 35%.

03

Demonstrates effective mitigation of small model limitations through collaboration.

Abstract

While Large Language Models (LLMs) have catalyzed breakthroughs in automated code generation, Small Language Models (SLMs) often encounter reasoning bottlenecks and failure loops when addressing complex logical requirements. To overcome these challenges, we propose DebateCoder, a multi-agent collaborative framework designed to improve the reasoning ability of SLMs (e.g., Pangu-1B) in resource-constrained environments. DebateCoder uses a structured role-playing protocol with three agents: User Agent (A_UA), Technical Agent (A_TA), and Quality Assurance Agent (A_QA). It also includes an Adaptive Confidence Gating mechanism with a 95% threshold to balance accuracy and inference efficiency. In addition, we introduce a multi-turn deliberation module and a reviewer-guided analytical debugging loop for orthogonal pre-generation debate and post-generation refinement. Experiments on HumanEval…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research · Software Engineering Techniques and Practices