Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading   Comprehension

Minghao Hu; Yuxing Peng; Zhen Huang; Dongsheng Li

arXiv:1906.04618·cs.CL·June 12, 2019·5 cites

Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading Comprehension

Minghao Hu, Yuxing Peng, Zhen Huang, Dongsheng Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces RE$^3$QA, an end-to-end unified model for multi-document reading comprehension that improves efficiency and performance by sharing representations and training components jointly, outperforming pipelined systems.

Contribution

The paper proposes a novel end-to-end model that unifies retrieval, reading, and reranking, enabling joint training and better utilization of upstream outputs for multi-document QA.

Findings

01

Outperforms pipelined baselines on TriviaQA and SQuAD datasets.

02

Achieves state-of-the-art results in multi-document reading comprehension.

03

Demonstrates improved efficiency through shared contextualized representations.

Abstract

This paper considers the reading comprehension task in which multiple documents are given as input. Prior work has shown that a pipeline of retriever, reader, and reranker can improve the overall performance. However, the pipeline system is inefficient since the input is re-encoded within each module, and is unable to leverage upstream components to help downstream training. In this work, we present RE $^{3}$ QA, a unified question answering model that combines context retrieving, reading comprehension, and answer reranking to predict the final answer. Unlike previous pipelined approaches, RE $^{3}$ QA shares contextualized text representation across different components, and is carefully designed to use high-quality upstream outputs (e.g., retrieved context or candidate answers) for directly supervising downstream modules (e.g., the reader or the reranker). As a result, the whole network can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

huminghao16/RE3QA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications