Learning Agent Routing From Early Experience
Yimin Wang, Jiahao Qiu, Xuan Qi, Xinzhe Juan, Jingzhe Shi, Zelin Zhao, Hongru Wang, Shilong Liu, Mengdi Wang

TL;DR
BoundaryRouter is a training-free framework that efficiently routes queries between lightweight LLM inference and full agent execution, significantly reducing latency and improving accuracy using early experience and rubric-guided reasoning.
Contribution
The paper introduces BoundaryRouter, a novel routing method that leverages early behavioral experience for effective decision-making without additional training.
Findings
BoundaryRouter reduces inference time by 60.6%.
It improves performance by 28.6% over direct LLM inference.
Outperforms prompt-based and retrieval-only routing methods.
Abstract
LLM agents achieve strong performance on complex reasoning tasks but incur high latency and compute cost. In practice, many queries fall within the capability boundary of cutting-edge LLMs and do not require full agent execution, making effective routing between LLMs and agents a key challenge. We study the problem of routing queries between lightweight LLM inference and full agent execution under realistic cold-start settings. To address this, we propose BoundaryRouter, a training-free routing framework that uses early behavioral experience and rubric-guided reasoning to decide whether to answer a query with direct LLM inference or escalate to an agent. BoundaryRouter builds a compact experience memory by executing both systems on a shared seed set and retrieves similar cases at inference time to guide routing decisions. To evaluate this method, we introduce RouteBench, a benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
