Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling

Xinglin Wang; Yiwei Li; Shaoxiong Feng; Peiwen Yuan; Yueqi Zhang; Jiayi Shi; Chuyi Tan; Boyuan Pan; Yao Hu; Kan Li

arXiv:2506.15707·cs.LG·October 21, 2025

Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling

Xinglin Wang, Yiwei Li, Shaoxiong Feng, Peiwen Yuan, Yueqi Zhang, Jiayi Shi, Chuyi Tan, Boyuan Pan, Yao Hu, Kan Li

PDF

Open Access

TL;DR

This paper introduces DORA, a novel resource allocation method for test-time scaling of large language models, optimizing compute use during search to improve accuracy on mathematical reasoning benchmarks.

Contribution

The paper formulates test-time search as a resource allocation problem and proposes DORA, the first provably optimal method that decouples direction quality from candidate count for efficient compute use.

Findings

01

DORA outperforms baselines on mathematical reasoning benchmarks.

02

DORA achieves state-of-the-art accuracy with comparable computational cost.

03

Optimal resource allocation improves test-time reasoning efficiency.

Abstract

Test-Time Scaling (TTS) improves the performance of Large Language Models (LLMs) by using additional inference-time computation to explore multiple reasoning paths through search. Yet how to allocate a fixed rollout budget most effectively during search remains underexplored, often resulting in inefficient use of compute at test time. To bridge this gap, we formulate test-time search as a resource allocation problem and derive the optimal allocation strategy that maximizes the probability of obtaining a correct solution under a fixed rollout budget. Within this formulation, we reveal a core limitation of existing search methods: solution-level allocation tends to favor reasoning directions with more candidates, leading to theoretically suboptimal and inefficient use of compute. To address this, we propose Direction-Oriented Resource Allocation (DORA), a provably optimal method that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications