Large-scale Simple Question Answering with Memory Networks

Antoine Bordes; Nicolas Usunier; Sumit Chopra; Jason Weston

arXiv:1506.02075·cs.LG·June 9, 2015·564 cites

Large-scale Simple Question Answering with Memory Networks

Antoine Bordes, Nicolas Usunier, Sumit Chopra, Jason Weston

PDF

Open Access 3 Repos 1 Datasets

TL;DR

This paper explores how multitask and transfer learning can improve large-scale simple question answering systems using Memory Networks, introducing a new dataset and demonstrating successful training for high performance.

Contribution

It introduces a new large-scale dataset for simple question answering and demonstrates the effectiveness of multitask and transfer learning within Memory Networks.

Findings

01

Memory Networks can be effectively trained for large-scale QA.

02

Multitask and transfer learning improve performance.

03

A new dataset of 100k questions supports large-scale evaluation.

Abstract

Training large-scale question answering systems is complicated because training sources usually cover a small portion of the range of possible questions. This paper studies the impact of multitask and transfer learning for simple question answering; a setting for which the reasoning required to answer is quite easy, as long as one can retrieve the correct evidence given a question, which can be difficult in large-scale conditions. To this end, we introduce a new dataset of 100k questions that we use in conjunction with existing benchmarks. We conduct our study within the framework of Memory Networks (Weston et al., 2015) because this perspective allows us to eventually scale up to more complex reasoning, and show that Memory Networks can be successfully trained to achieve excellent performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Orange/simplequestions-sparqltotext
dataset· 65 dl
65 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications