Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers

Yang Li

arXiv:2505.12601·cs.LG·May 18, 2026

Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers

Yang Li

PDF

TL;DR

This paper demonstrates that a simple kNN approach can outperform complex learned routers for LLM model selection across various tasks, emphasizing the value of simplicity and standardized benchmarks.

Contribution

It introduces a suite of routing benchmarks and shows that kNN-based routing often surpasses state-of-the-art learned methods, challenging the trend towards complex architectures.

Findings

01

kNN matches or outperforms learned routers across tasks

02

Locality in embedding space enables effective simple routing

03

Standardized benchmarks facilitate fair comparison

Abstract

As large language models (LLMs) grow in scale and specialization, routing--selecting the best model for a given input--has become essential for efficient and effective deployment. While recent methods rely on complex learned routing strategies, their dependence on disparate training data and evaluation setups makes comparison and generalization difficult. In this work, we revisit LLM routing through the lens of simplicity. We show that a well-tuned k-Nearest Neighbors (kNN) approach not only matches but often outperforms state-of-the-art learned routers across diverse tasks. To support systematic evaluation, we introduce a suite of standardized routing benchmarks spanning instruction-following, question-answering, and reasoning tasks, as well as the first multi-modal routing dataset involving visual inputs. Our findings reveal that the locality properties of model performance in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Software-Defined Networks and 5G · Advanced Neural Network Applications