Agentic Systems as Boosting Weak Reasoning Models
Varun Sunkaraneni, Pierfrancesco Beneventano, Riccardo Neumarker, Tomaso Poggio, Tomer Galanti

TL;DR
This paper investigates how committees of weak reasoning models can be boosted at inference time to match stronger models, emphasizing the importance of local signals and selection strategies.
Contribution
It formalizes the mechanisms of inference-time boosting, analyzes the limits of proposal coverage and local signals, and empirically demonstrates significant performance gains with committee methods.
Findings
Committee boosting improves task success from 67.0% to 76.4%.
Local signals like execution or proof checking are essential for reliable amplification.
Coverage limitations indicate shared blind spots in weak models.
Abstract
Can a committee of weak reasoning-model calls reach the performance of much stronger models? We study verifier-backed committee search as inference-time boosting for reasoning language models. The mechanism is not simply that ``more agents help'': samples expose latent correct solutions, while critics and comparators must recover them without access to the hidden verifier. We formalize this view by separating proposal coverage, local identifiability, progress, and diversity. We prove that coverage can be amplified by repeated sampling, but cannot by itself create useful critics or comparators; reliable amplification requires an additional local soundness signal, such as execution, proof checking, type checking, tests, or constraint solving. We give rank-based bounds showing when local selection errors compose into reliable trajectories, and characterize the proposer-side ceiling: oracle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
