Tracking the World State with Recurrent Entity Networks

Mikael Henaff; Jason Weston; Arthur Szlam; Antoine Bordes; Yann; LeCun

arXiv:1612.03969·cs.CL·May 11, 2017·157 cites

Tracking the World State with Recurrent Entity Networks

Mikael Henaff, Jason Weston, Arthur Szlam, Antoine Bordes, Yann, LeCun

PDF

Open Access 5 Repos

TL;DR

The paper introduces the Recurrent Entity Network (EntNet), a memory-augmented model capable of dynamic reasoning and maintaining world state, achieving state-of-the-art results on reasoning tasks and demonstrating strong generalization.

Contribution

It presents a novel recurrent memory architecture that updates multiple memory locations simultaneously, enabling on-the-fly reasoning and improved performance on complex language understanding tasks.

Findings

01

Sets new state-of-the-art on bAbI tasks

02

Solves reasoning tasks with many supporting facts

03

Performs competitively on large-scale datasets like Children's Book Test

Abstract

We introduce a new model, the Recurrent Entity Network (EntNet). It is equipped with a dynamic long-term memory which allows it to maintain and update a representation of the state of the world as it receives new data. For language understanding tasks, it can reason on-the-fly as it reads text, not just when it is required to answer a question or respond as is the case for a Memory Network (Sukhbaatar et al., 2015). Like a Neural Turing Machine or Differentiable Neural Computer (Graves et al., 2014; 2016) it maintains a fixed size memory and can learn to perform location and content-based read and write operations. However, unlike those models it has a simple parallel architecture in which several memory locations can be updated simultaneously. The EntNet sets a new state-of-the-art on the bAbI tasks, and is the first method to solve all the tasks in the 10k training examples setting.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)

MethodsSoftmax · Sigmoid Activation · Tanh Activation · Neural Turing Machine · Recurrent Entity Network · Location-based Attention · Content-based Attention · Long Short-Term Memory