RouterRetriever: Routing over a Mixture of Expert Embedding Models
Hyunji Lee, Luca Soldaini, Arman Cohan, Minjoon Seo, Kyle Lo

TL;DR
RouterRetriever introduces a routing-based retrieval model that leverages domain-specific experts, outperforming traditional single-model approaches on diverse datasets by selecting the most relevant expert for each query.
Contribution
This work pioneers the use of routing over a mixture of domain-specific expert embedding models in retrieval, demonstrating improved performance and flexibility.
Findings
Outperforms MSMARCO-trained models by +2.1 nDCG@10
Surpasses multi-task models by +3.2 nDCG@10
Routing mechanism outperforms other techniques by +1.8 on average
Abstract
Information retrieval methods often rely on a single embedding model trained on large, general-domain datasets like MSMARCO. While this approach can produce a retriever with reasonable overall performance, they often underperform models trained on domain-specific data when testing on their respective domains. Prior work in information retrieval has tackled this through multi-task training, but the idea of routing over a mixture of domain-specific expert retrievers remains unexplored despite the popularity of such ideas in language model generation research. In this work, we introduce RouterRetriever, a retrieval model that leverages a mixture of domain-specific experts by using a routing mechanism to select the most appropriate expert for each query. RouterRetriever is lightweight and allows easy addition or removal of experts without additional training. Evaluation on the BEIR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComplex Network Analysis Techniques · Expert finding and Q&A systems · Data-Driven Disease Surveillance
