Don't Overthink It: Inter-Rollout Action Agreement as a Free Adaptive-Compute Signal for LLM Agents
Khushal Sethi

TL;DR
TrACE is a training-free adaptive compute controller for LLM agents that allocates computational resources based on inter-rollout action agreement, improving efficiency without sacrificing accuracy.
Contribution
Introduces TrACE, a novel, training-free method for adaptive compute allocation in LLMs based on action agreement, validated on reasoning and navigation benchmarks.
Findings
TrACE reduces LLM call usage by up to 65% while maintaining accuracy.
Inter-rollout agreement reliably signals step difficulty and success.
TrACE outperforms fixed-budget self-consistency methods in efficiency.
Abstract
Inference-time compute scaling has emerged as a powerful technique for improving the reliability of large language model (LLM) agents, but existing methods apply compute uniformly: every decision step receives the same budget regardless of its difficulty. We introduce TrACE (Trajectorical Adaptive Compute via agrEement), a training-free controller that allocates LLM calls adaptively across agent timesteps by measuring inter-rollout action agreement. At each step, TrACE samples a small set of candidate next actions and measures how consistently the model commits to the same action. High agreement signals an easy decision; the controller commits immediately. Low agreement signals uncertainty; the controller samples additional rollouts up to a configurable cap before committing to the plurality action. No learned components, no external verifier, and no human labels are required. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
