Teaching Machines to Read and Comprehend
Karl Moritz Hermann, Tom\'a\v{s} Ko\v{c}isk\'y, Edward Grefenstette,, Lasse Espeholt, Will Kay, Mustafa Suleyman, Phil Blunsom

TL;DR
This paper introduces a new methodology for creating large-scale supervised reading comprehension datasets, enabling the development of neural networks that can read and answer questions about real documents with minimal prior language knowledge.
Contribution
It presents a novel methodology for generating large-scale datasets and develops attention-based neural networks capable of understanding and answering questions on real documents.
Findings
Successful training of neural networks on new large-scale datasets
Neural models can answer complex questions with minimal language prior knowledge
Improved performance over previous methods in machine reading comprehension
Abstract
Teaching machines to read natural language documents remains an elusive challenge. Machine reading systems can be tested on their ability to answer questions posed on the contents of documents that they have seen, but until now large scale training and test datasets have been missing for this type of evaluation. In this work we define a new methodology that resolves this bottleneck and provides large scale supervised reading comprehension data. This allows us to develop a class of attention based deep neural networks that learn to read real documents and answer complex questions with minimal prior knowledge of language structure.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsConcatenated Skip Connection · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Dropout · RMSProp · Deep LSTM Reader
