MoR: Better Handling Diverse Queries with a Mixture of Sparse, Dense, and Human Retrievers
Jushaan Singh Kalra, Xinran Zhao, To Eun Kim, Fengyu Cai, Fernando Diaz, Tongshuang Wu

TL;DR
This paper introduces a mixture of retrievers approach that dynamically combines sparse, dense, and human sources to improve retrieval-augmented generation, outperforming individual retrievers and larger models.
Contribution
It proposes a zero-shot, weighted mixture of heterogeneous retrievers that adaptively integrates multiple sources for better information retrieval in RAG systems.
Findings
Mixture of retrievers outperforms individual retrievers and larger models.
The approach achieves +10.8% and +3.9% improvements over single retrievers and 7B models.
Incorporating human sources yields a 58.9% relative performance boost.
Abstract
Retrieval-augmented Generation (RAG) is powerful, but its effectiveness hinges on which retrievers we use and how. Different retrievers offer distinct, often complementary signals: BM25 captures lexical matches; dense retrievers, semantic similarity. Yet in practice, we typically fix a single retriever based on heuristics, which fails to generalize across diverse information needs. Can we dynamically select and integrate multiple retrievers for each individual query, without the need for manual selection? In our work, we validate this intuition with quantitative analysis and introduce mixture of retrievers: a zero-shot, weighted combination of heterogeneous retrievers. Extensive experiments show that such mixtures are effective and efficient: Despite totaling just 0.8B parameters, this mixture outperforms every individual retriever and even larger 7B models by +10.8% and +3.9% on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsInformation Retrieval and Search Behavior · Multimodal Machine Learning Applications · Topic Modeling
