Unsupervised Query Routing for Retrieval Augmented Generation
Feiteng Mu, Liwen Zhang, Yong Jiang, Wenjie Li, Zhen Zhang, Pengjun, Xie, Fei Huang

TL;DR
This paper presents an unsupervised query routing method for retrieval-augmented generation that improves scalability and generalization by eliminating manual annotations and using an upper-bound response for evaluation.
Contribution
The paper introduces a novel unsupervised approach for query routing that constructs upper-bound responses to evaluate retrieval quality without manual labels.
Findings
Significantly improves scalability in query routing.
Enhances generalization to out-of-distribution queries.
Demonstrates strong performance across five datasets.
Abstract
Query routing for retrieval-augmented generation aims to assign an input query to the most suitable search engine. Existing works rely heavily on supervised datasets that require extensive manual annotation, resulting in high costs and limited scalability, as well as poor generalization to out-of-distribution scenarios. To address these challenges, we introduce a novel unsupervised method that constructs the "upper-bound" response to evaluate the quality of retrieval-augmented responses. This evaluation enables the decision of the most suitable search engine for a given query. By eliminating manual annotations, our approach can automatically process large-scale real user queries and create training data. We conduct extensive experiments across five datasets, demonstrating that our method significantly enhances scalability and generalization capabilities.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Neural Networks and Applications · Machine Learning and ELM
