The NarrativeQA Reading Comprehension Challenge

Tom\'a\v{s} Ko\v{c}isk\'y; Jonathan Schwarz; Phil Blunsom; Chris Dyer,; Karl Moritz Hermann; G\'abor Melis; Edward Grefenstette

arXiv:1712.07040·cs.CL·December 20, 2017

The NarrativeQA Reading Comprehension Challenge

Tom\'a\v{s} Ko\v{c}isk\'y, Jonathan Schwarz, Phil Blunsom, Chris Dyer,, Karl Moritz Hermann, G\'abor Melis, Edward Grefenstette

PDF

2 Repos 2 Models 5 Datasets

TL;DR

The paper introduces the NarrativeQA dataset and tasks to evaluate deep reading comprehension, emphasizing understanding of narratives over superficial pattern matching, and demonstrates the difficulty models face on these tasks.

Contribution

It presents a new dataset and tasks for narrative comprehension that require understanding entire stories, addressing limitations of existing RC datasets.

Findings

01

Humans solve the tasks easily.

02

Standard RC models struggle with the tasks.

03

The dataset encourages development of models with deeper understanding.

Abstract

Reading comprehension (RC)---in contrast to information retrieval---requires integrating information and reasoning about events, entities, and their relations across a full document. Question answering is conventionally used to assess RC ability, in both artificial agents and children learning to read. However, existing RC datasets and tasks are dominated by questions that can be solved by selecting answers using superficial information (e.g., local context similarity or global term frequency); they thus fail to test for the essential integrative aspect of RC. To encourage progress on deeper comprehension of language, we present a new dataset and set of tasks in which the reader must answer questions about stories by reading entire books or movie scripts. These tasks are designed so that successfully answering their questions requires understanding the underlying narrative rather than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.