Multi-task Retrieval for Knowledge-Intensive Tasks

Jean Maillard; Vladimir Karpukhin; Fabio Petroni; Wen-tau Yih; Barlas; O\u{g}uz; Veselin Stoyanov; Gargi Ghosh

arXiv:2101.00117·cs.CL·January 5, 2021

Multi-task Retrieval for Knowledge-Intensive Tasks

Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas, O\u{g}uz, Veselin Stoyanov, Gargi Ghosh

PDF

1 Datasets

TL;DR

This paper introduces a multi-task neural retrieval model that is robust across various knowledge-intensive tasks, outperforming previous methods especially in few-shot scenarios and matching state-of-the-art results.

Contribution

The paper presents a universal multi-task trained neural retrieval model that improves robustness and performance across diverse tasks and data regimes.

Findings

01

Outperforms previous retrieval methods in few-shot settings

02

Rivals specialized neural retrievers with abundant in-domain data

03

Enhances downstream task models and achieves state-of-the-art results

Abstract

Retrieving relevant contexts from a large corpus is a crucial step for tasks such as open-domain question answering and fact checking. Although neural retrieval outperforms traditional methods like tf-idf and BM25, its performance degrades considerably when applied to out-of-domain data. Driven by the question of whether a neural retrieval model can be universal and perform robustly on a wide variety of problems, we propose a multi-task trained model. Our approach not only outperforms previous methods in the few-shot setting, but also rivals specialised neural retrievers, even when in-domain training data is abundant. With the help of our retriever, we improve existing models for downstream tasks and closely match or improve the state of the art on multiple benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

corag/kilt-corpus
dataset· 257 dl
257 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.