TL;DR
AgentOpt is a Python framework that optimizes client-side resource allocation in multi-step LLM-based agents, significantly reducing costs while maintaining accuracy.
Contribution
It introduces a novel, framework-agnostic approach for client-side optimization, focusing on model selection within multi-step agent pipelines.
Findings
UCB-E algorithm recovers near-optimal accuracy.
Cost gap between models can reach 13-32x.
UCB-E reduces evaluation budget by 62-76%.
Abstract
AI agents are increasingly deployed in real-world applications, including systems such as Manus, OpenClaw, and coding agents. Existing research has primarily focused on server-side efficiency, proposing methods such as caching, speculative execution, traffic scheduling, and load balancing to reduce the cost of serving agentic workloads. However, as users increasingly construct agents by composing local tools, remote APIs, and diverse models, an equally important optimization problem arises on the client side. Client-side optimization asks how developers should allocate the resources available to them, including model choice, local tools, and API budget across pipeline stages, subject to application-specific quality, cost, and latency constraints. Because these objectives depend on the task and deployment setting, they cannot be determined by server-side systems alone. We introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
