TL;DR
This paper introduces the Globally Normalized Reader, a scalable and efficient extractive question answering model that reformulates the task as an iterative search, achieving high performance and speed on SQuAD.
Contribution
It proposes a novel globally normalized, iterative search approach for extractive QA, improving scalability and efficiency over previous bi-attention models.
Findings
Achieves 68.4 EM and 76.21 F1 on SQuAD dev set.
Runs 24.7x faster than bi-attention-flow models.
Data augmentation with entity swapping enhances model performance.
Abstract
Rapid progress has been made towards question answering (QA) systems that can extract answers from text. Existing neural approaches make use of expensive bi-directional attention mechanisms or score all possible answer spans, limiting scalability. We propose instead to cast extractive QA as an iterative search problem: select the answer's sentence, start word, and end word. This representation reduces the space of each search step and allows computation to be conditionally allocated to promising search paths. We show that globally normalizing the decision process and back-propagating through beam search makes this representation viable and learning efficient. We empirically demonstrate the benefits of this approach using our model, Globally Normalized Reader (GNR), which achieves the second highest single model performance on the Stanford Question Answering Dataset (68.4 EM, 76.21 F1…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
