R$^3$AG: Retriever Routing for Retrieval-Augmented Generation

Tong Zhao; Yutao Zhu; Yucheng Tian; Zhicheng Dou

arXiv:2604.22849·cs.IR·April 28, 2026

R$^3$AG: Retriever Routing for Retrieval-Augmented Generation

Tong Zhao, Yutao Zhu, Yucheng Tian, Zhicheng Dou

PDF

TL;DR

R$^3$AG introduces a dynamic routing framework for retrieval-augmented generation that models query-specific preferences by decomposing retriever capabilities into retrieval quality and generation utility, leading to improved task performance.

Contribution

It proposes a novel dynamic routing method that explicitly models and learns query-specific retriever preferences, surpassing static routing approaches.

Findings

01

R$^3$AG outperforms static routing methods on multiple knowledge-intensive tasks.

02

Decomposing retriever capabilities improves retrieval relevance and answer correctness.

03

Contrastive learning effectively captures query-specific preference shifts.

Abstract

Retrieval-augmented generation (RAG) has become a cornerstone for knowledge-intensive tasks. However, the efficacy of RAG is often bottlenecked by the ``one-size-fits-all'' retrieval paradigm, as different queries exhibit distinct preferences for different retrievers. While recent routing techniques attempt to select the optimal retriever dynamically, they typically operate under a ``single and static capability'' assumption, selecting retrievers solely based on semantic relevance. This overlooks a critical distinction in RAG: a retrieved document must not only be relevant but also effectively support the generator in producing correct answers. To address this limitation, we propose R $^{3}$ AG, a novel routing framework that explicitly models the dynamic alignment between queries and retriever capabilities. Unlike previous approaches, R $^{3}$ AG decomposes retriever capability into two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.