Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James Glass

TL;DR
This paper introduces EAR, a method combining query expansion and reranking to improve passage retrieval for open-domain question answering, significantly boosting accuracy over traditional and dense retrieval methods.
Contribution
The paper presents a novel EAR approach that trains a reranker to select optimal expanded queries, enhancing retrieval performance in open-domain QA tasks.
Findings
EAR improves top-5/20 accuracy by 3-8 and 5-10 points in in-domain and out-of-domain settings.
EAR outperforms vanilla query expansion (GAR) and dense retrieval (DPR) methods.
Connecting query expansion with reranking significantly enhances retrieval results.
Abstract
We propose EAR, a query Expansion And Reranking approach for improving passage retrieval, with the application to open-domain question answering. EAR first applies a query expansion model to generate a diverse set of queries, and then uses a query reranker to select the ones that could lead to better retrieval results. Motivated by the observation that the best query expansion often is not picked by greedy decoding, EAR trains its reranker to predict the rank orders of the gold passages when issuing the expanded queries to a given retriever. By connecting better the query expansion model and retriever, EAR significantly enhances a traditional sparse retrieval method, BM25. Empirically, EAR improves top-5/20 accuracy by 3-8 and 5-10 points in in-domain and out-of-domain settings, respectively, when compared to a vanilla query expansion model, GAR, and a dense retrieval model, DPR.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
