Low-Resource Dense Retrieval for Open-Domain Question Answering: A Comprehensive Survey
Xiaoyu Shen, Svitlana Vakulenko, Marco del Tredici, Gianni Barlacchi,, Bill Byrne, Adri\`a de Gispert

TL;DR
This survey reviews low-resource dense retrieval techniques for open-domain question answering, categorizing methods based on resource requirements and discussing their advantages, challenges, and future directions.
Contribution
It provides a comprehensive structured overview of diverse low-resource dense retrieval methods, aiding in selecting suitable techniques for specific scenarios.
Findings
Techniques categorized by resource needs: documents only, documents and questions, documents and QA pairs.
Analysis of open issues, pros, and cons for each method.
Outlines promising future research directions.
Abstract
Dense retrieval (DR) approaches based on powerful pre-trained language models (PLMs) achieved significant advances and have become a key component for modern open-domain question-answering systems. However, they require large amounts of manual annotations to perform competitively, which is infeasible to scale. To address this, a growing body of research works have recently focused on improving DR performance under low-resource scenarios. These works differ in what resources they require for training and employ a diverse set of techniques. Understanding such differences is crucial for choosing the right technique under a specific low-resource scenario. To facilitate this understanding, we provide a thorough structured overview of mainstream techniques for low-resource DR. Based on their required resources, we divide the techniques into three main categories: (1) only documents are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
