Towards Universal Dense Retrieval for Open-domain Question Answering
Christopher Sciavolino

TL;DR
This paper investigates the limitations of dense retrieval models in open-domain question answering, especially their poor generalization to unseen question distributions, and proposes new training techniques to enhance out-of-domain performance.
Contribution
It introduces an entity-rich QA dataset from Wikidata, analyzes dense models' generalization issues, and proposes training methods to improve their out-of-domain effectiveness.
Findings
Dense models struggle to generalize to unseen question distributions.
New training techniques can improve out-of-domain retrieval performance.
The paper advocates for developing a universal dense retrieval model.
Abstract
In open-domain question answering, a model receives a text question as input and searches for the correct answer using a large evidence corpus. The retrieval step is especially difficult as typical evidence corpora have \textit{millions} of documents, each of which may or may not have the correct answer to the question. Very recently, dense models have replaced sparse methods as the de facto retrieval method. Rather than focusing on lexical overlap to determine similarity, dense methods build an encoding function that captures semantic similarity by learning from a small collection of question-answer or question-context pairs. In this paper, we investigate dense retrieval models in the context of open-domain question answering across different input distributions. To do this, first we introduce an entity-rich question answering dataset constructed from Wikidata facts and demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
