EvoRoute: Experience-Driven Self-Routing LLM Agent Systems

Guibin Zhang; Haiyang Yu; Kaiming Yang; Bingli Wu; Fei Huang; Yongbin Li; Shuicheng Yan

arXiv:2601.02695·cs.CL·January 7, 2026

EvoRoute: Experience-Driven Self-Routing LLM Agent Systems

Guibin Zhang, Haiyang Yu, Kaiming Yang, Bingli Wu, Fei Huang, Yongbin Li, Shuicheng Yan

PDF

Open Access

TL;DR

EvoRoute introduces an adaptive routing system for LLM-based agents that dynamically balances performance, cost, and speed by learning from experience, significantly reducing expenses and latency while maintaining high task effectiveness.

Contribution

It presents EvoRoute, a novel self-evolving model routing approach that optimizes LLM selection in real-time based on environment feedback, surpassing static routing methods.

Findings

01

Reduces execution cost by up to 80%

02

Lowers latency by over 70%

03

Maintains or improves system performance

Abstract

Complex agentic AI systems, powered by a coordinated ensemble of Large Language Models (LLMs), tool and memory modules, have demonstrated remarkable capabilities on intricate, multi-turn tasks. However, this success is shadowed by prohibitive economic costs and severe latency, exposing a critical, yet underexplored, trade-off. We formalize this challenge as the \textbf{Agent System Trilemma}: the inherent tension among achieving state-of-the-art performance, minimizing monetary cost, and ensuring rapid task completion. To dismantle this trilemma, we introduce EvoRoute, a self-evolving model routing paradigm that transcends static, pre-defined model assignments. Leveraging an ever-expanding knowledge base of prior experience, EvoRoute dynamically selects Pareto-optimal LLM backbones at each step, balancing accuracy, efficiency, and resource use, while continually refining its own…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Big Data and Digital Economy · Explainable Artificial Intelligence (XAI)