Learning Agent Routing From Early Experience

Yimin Wang; Jiahao Qiu; Xuan Qi; Xinzhe Juan; Jingzhe Shi; Zelin Zhao; Hongru Wang; Shilong Liu; Mengdi Wang

arXiv:2605.07180·cs.CL·May 11, 2026

Learning Agent Routing From Early Experience

Yimin Wang, Jiahao Qiu, Xuan Qi, Xinzhe Juan, Jingzhe Shi, Zelin Zhao, Hongru Wang, Shilong Liu, Mengdi Wang

PDF

TL;DR

BoundaryRouter is a training-free framework that efficiently routes queries between lightweight LLM inference and full agent execution, significantly reducing latency and improving accuracy using early experience and rubric-guided reasoning.

Contribution

The paper introduces BoundaryRouter, a novel routing method that leverages early behavioral experience for effective decision-making without additional training.

Findings

01

BoundaryRouter reduces inference time by 60.6%.

02

It improves performance by 28.6% over direct LLM inference.

03

Outperforms prompt-based and retrieval-only routing methods.

Abstract

LLM agents achieve strong performance on complex reasoning tasks but incur high latency and compute cost. In practice, many queries fall within the capability boundary of cutting-edge LLMs and do not require full agent execution, making effective routing between LLMs and agents a key challenge. We study the problem of routing queries between lightweight LLM inference and full agent execution under realistic cold-start settings. To address this, we propose BoundaryRouter, a training-free routing framework that uses early behavioral experience and rubric-guided reasoning to decide whether to answer a query with direct LLM inference or escalate to an agent. BoundaryRouter builds a compact experience memory by executing both systems on a shared seed set and retrieves similar cases at inference time to guide routing decisions. To evaluate this method, we introduce RouteBench, a benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.