Learning Recurrent Span Representations for Extractive Question   Answering

Kenton Lee; Shimi Salant; Tom Kwiatkowski; Ankur Parikh; Dipanjan Das,; Jonathan Berant

arXiv:1611.01436·cs.CL·March 21, 2017·127 cites

Learning Recurrent Span Representations for Extractive Question Answering

Kenton Lee, Shimi Salant, Tom Kwiatkowski, Ankur Parikh, Dipanjan Das,, Jonathan Berant

PDF

Open Access 2 Repos

TL;DR

This paper introduces a recurrent span representation model for extractive question answering, significantly improving accuracy on the SQuAD dataset by directly modeling answer spans.

Contribution

The paper presents a novel recurrent architecture for fixed-length span representations that enhances extractive QA performance over previous methods.

Findings

01

Improves SQuAD answer extraction accuracy by 5%.

02

Reduces baseline error by over 50%.

03

Demonstrates the effectiveness of explicit span modeling.

Abstract

The reading comprehension task, that asks questions about a given evidence document, is a central problem in natural language understanding. Recent formulations of this task have typically focused on answer selection from a set of candidates pre-defined manually or through the use of an external NLP pipeline. However, Rajpurkar et al. (2016) recently released the SQuAD dataset in which the answers can be arbitrary strings from the supplied text. In this paper, we focus on this answer extraction task, presenting a novel model architecture that efficiently builds fixed length representations of all spans in the evidence document with a recurrent network. We show that scoring explicit span representations significantly improves performance over other approaches that factor the prediction into separate predictions about words or start and end markers. Our approach improves upon the best…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques