Budget-Aware Tool-Use Enables Effective Agent Scaling
Tengxiao Liu, Zifeng Wang, Jin Miao, I-Hung Hsu, Jun Yan, Jiefeng Chen, Rujun Han, Fangyuan Xu, Yanfei Chen, Ke Jiang, Samira Daruki, Yi Liang, William Yang Wang, Tomas Pfister, Chen-Yu Lee

TL;DR
This paper introduces a budget-aware framework for scaling tool-augmented language agents, improving their efficiency and performance by dynamically managing tool call budgets during task execution.
Contribution
It proposes the Budget Tracker and BATS framework, enabling agents to adapt their planning based on explicit tool-call budgets, a novel approach for cost-effective agent scaling.
Findings
Budget-aware methods achieve better cost-performance scaling.
Agents with budget awareness reach higher performance levels.
The formalized cost metric aids in systematic analysis of scaling strategies.
Abstract
Scaling test-time computation improves performance across different tasks on large language models (LLMs), which has also been extended to tool-augmented agents. For these agents, scaling involves not only "thinking" in tokens but also "acting" via tool calls. The number of tool calls directly bounds the agent's interaction with the external environment. However, we find that simply granting agents a larger tool-call budget fails to improve performance, as they lack "budget awareness" and quickly hit a performance ceiling. To address this, we study how to scale such agents effectively under explicit tool-call budgets, focusing on web search agents. We first introduce the Budget Tracker, a lightweight plug-in that provides the agent with continuous budget awareness, enabling simple yet effective scaling. We further develop BATS (Budget Aware Test-time Scaling), an advanced framework that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Mobile Crowdsensing and Crowdsourcing · Natural Language Processing Techniques
