Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading Comprehension
Minghao Hu, Yuxing Peng, Zhen Huang, Dongsheng Li

TL;DR
This paper introduces RE$^3$QA, an end-to-end unified model for multi-document reading comprehension that improves efficiency and performance by sharing representations and training components jointly, outperforming pipelined systems.
Contribution
The paper proposes a novel end-to-end model that unifies retrieval, reading, and reranking, enabling joint training and better utilization of upstream outputs for multi-document QA.
Findings
Outperforms pipelined baselines on TriviaQA and SQuAD datasets.
Achieves state-of-the-art results in multi-document reading comprehension.
Demonstrates improved efficiency through shared contextualized representations.
Abstract
This paper considers the reading comprehension task in which multiple documents are given as input. Prior work has shown that a pipeline of retriever, reader, and reranker can improve the overall performance. However, the pipeline system is inefficient since the input is re-encoded within each module, and is unable to leverage upstream components to help downstream training. In this work, we present REQA, a unified question answering model that combines context retrieving, reading comprehension, and answer reranking to predict the final answer. Unlike previous pipelined approaches, REQA shares contextualized text representation across different components, and is carefully designed to use high-quality upstream outputs (e.g., retrieved context or candidate answers) for directly supervising downstream modules (e.g., the reader or the reranker). As a result, the whole network can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
