Budget-Aware Agentic Routing via Boundary-Guided Training

Caiqi Zhang; Menglin Xia; Xuchao Zhang; Daniel Madrigal; Ankur Mallick; Samuel Kessler; Victor Ruehle; Saravan Rajmohan

arXiv:2602.21227·cs.CL·February 26, 2026

Budget-Aware Agentic Routing via Boundary-Guided Training

Caiqi Zhang, Menglin Xia, Xuchao Zhang, Daniel Madrigal, Ankur Mallick, Samuel Kessler, Victor Ruehle, Saravan Rajmohan

PDF

Open Access

TL;DR

This paper introduces a budget-aware agentic routing framework for large language models, optimizing cost and success rate in sequential decision-making under strict budget constraints.

Contribution

It proposes Boundary-Guided Training and Policy Optimization methods to improve cost-efficiency and generalize to strict budget constraints in agentic routing.

Findings

01

Improves efficiency frontier compared to baselines

02

Achieves strong routing performance at lower costs

03

Generalizes well under strict inference-time budgets

Abstract

As large language models (LLMs) evolve into autonomous agents that execute long-horizon workflows, invoking a high-capability model at every step becomes economically unsustainable. While model routing is effective for single-turn queries, agentic routing is a sequential, path-dependent problem: early mistakes compound, feedback is often at the end of the episode, and deployments often demand strict per-task spending limits. We propose Budget-Aware Agentic Routing, which selects between a cheap and an expensive model at each step to optimize the cost--success frontier and to operate under strict per-task budgets. We propose Boundary-Guided Training, which leverages two boundary policies (always-small vs.\ always-large) to build a difficulty taxonomy and to anchor learning under sparse rewards. Our approach warms start with boundary-guided SFT data synthesis via stratified sampling of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Graph Neural Networks · Multimodal Machine Learning Applications