Lightweight Query Routing for Adaptive RAG: A Baseline Study on RAGRouter-Bench

Prakhar Bansal; Shivangi Agarwal

arXiv:2604.03455·cs.IR·April 7, 2026

Lightweight Query Routing for Adaptive RAG: A Baseline Study on RAGRouter-Bench

Prakhar Bansal, Shivangi Agarwal

PDF

TL;DR

This study evaluates lightweight classifier-based routing strategies for Retrieval-Augmented Generation, demonstrating high accuracy and significant token savings on the RAGRouter-Bench benchmark across various domains.

Contribution

First systematic evaluation of lightweight classifiers for query routing in RAG, establishing a strong baseline and analyzing feature effectiveness and domain-specific challenges.

Findings

01

TF-IDF with SVM achieves 93.2% accuracy and 28.1% token savings.

02

Lexical features outperform semantic embeddings in routing accuracy.

03

Medical queries are more challenging to route than legal queries.

Abstract

Retrieval-Augmented Generation pipelines span a wide range of retrieval strategies that differ substantially in token cost and capability. Selecting the right strategy per query is a practical efficiency problem, yet no routing classifiers have been trained on RAGRouter-Bench \citep{wang2026ragrouterbench}, a recently released benchmark of $7, 727$ queries spanning four knowledge domains, each annotated with one of three canonical query types: factual, reasoning, and summarization. We present the first systematic evaluation of lightweight classifier-based routing on this benchmark. Five classical classifiers are evaluated under three feature regimes, namely, TF-IDF, MiniLM sentence embeddings \citep{reimers2019sbert}, and hand-crafted structural features, yielding 15 classifier feature combinations. Our best configuration, TF-IDF with an SVM, achieves a macro-averaged F1 of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.