When Independent Sampling Outperforms Agentic Reasoning

Yihe Dong,Boris Shigida

arXiv:2605.08478·cs.LG·May 12, 2026

When Independent Sampling Outperforms Agentic Reasoning

Yihe Dong,Boris Shigida

PDF

TL;DR

This study compares agent-based reasoning and independent sampling for competitive programming, finding that independent sampling often outperforms agentic reasoning in accuracy-cost tradeoffs under fixed compute budgets.

Contribution

It demonstrates that independent exploration can surpass agentic reasoning in algorithmic tasks, providing empirical results and a budget-allocation analysis.

Findings

01

k-shot sampling outperforms agent reasoning across models and difficulty levels.

02

Independent sampling achieves better accuracy-cost tradeoffs.

03

A theoretical analysis shows optimal budget allocation minimizes log failure likelihood.

Abstract

We study how to allocate inference-time compute for competitive programming under fixed budgets. Evaluating 216 Codeforces problems across Divisions 1-3, we compare agent-based reasoning with repeated independent sampling (k-shot) as a function of both cost and number of model calls. Across models and difficulty levels, k-shot consistently achieves a better accuracy-cost and accuracy-query tradeoff. This gap persists despite prompt caching in agent frameworks, indicating lower per-call effectiveness. Our results show that, for self-contained algorithmic tasks, independent exploration can outperform deeper agentic reasoning under realistic resource constraints. We also provide a budget-allocation analysis when the inference budget is fixed, and prove that a cost-optimal solver minimizes the principled metric log failure likelihood per dollar.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.