Distilling Knowledge from Reader to Retriever for Question Answering

Gautier Izacard; Edouard Grave

arXiv:2012.04584·cs.CL·August 5, 2022·39 cites

Distilling Knowledge from Reader to Retriever for Question Answering

Gautier Izacard, Edouard Grave

PDF

Open Access 4 Repos 2 Models 1 Video

TL;DR

This paper introduces a knowledge distillation-based method to train retriever models for question answering without needing annotated query-document pairs, leading to improved retrieval performance.

Contribution

It proposes a novel approach that uses reader attention scores to generate synthetic labels for training retrievers, eliminating the need for supervised query-document data.

Findings

01

Achieved state-of-the-art results on question answering tasks.

02

Demonstrated effectiveness of synthetic labels derived from reader attention.

03

Improved retrieval accuracy without annotated training pairs.

Abstract

The task of information retrieval is an important component of many natural language processing systems, such as open domain question answering. While traditional methods were based on hand-crafted features, continuous representations based on neural networks recently obtained competitive results. A challenge of using such methods is to obtain supervised data to train the retriever model, corresponding to pairs of query and support documents. In this paper, we propose a technique to learn retriever models for downstream tasks, inspired by knowledge distillation, and which does not require annotated pairs of query and documents. Our approach leverages attention scores of a reader model, used to solve the task based on retrieved documents, to obtain synthetic labels for the retriever. We evaluate our method on question answering, obtaining state-of-the-art results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

Distilling Knowledge from Reader to Retriever for Question Answering· slideslive

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques