Tracking the World State with Recurrent Entity Networks
Mikael Henaff, Jason Weston, Arthur Szlam, Antoine Bordes, Yann, LeCun

TL;DR
The paper introduces the Recurrent Entity Network (EntNet), a memory-augmented model capable of dynamic reasoning and maintaining world state, achieving state-of-the-art results on reasoning tasks and demonstrating strong generalization.
Contribution
It presents a novel recurrent memory architecture that updates multiple memory locations simultaneously, enabling on-the-fly reasoning and improved performance on complex language understanding tasks.
Findings
Sets new state-of-the-art on bAbI tasks
Solves reasoning tasks with many supporting facts
Performs competitively on large-scale datasets like Children's Book Test
Abstract
We introduce a new model, the Recurrent Entity Network (EntNet). It is equipped with a dynamic long-term memory which allows it to maintain and update a representation of the state of the world as it receives new data. For language understanding tasks, it can reason on-the-fly as it reads text, not just when it is required to answer a question or respond as is the case for a Memory Network (Sukhbaatar et al., 2015). Like a Neural Turing Machine or Differentiable Neural Computer (Graves et al., 2014; 2016) it maintains a fixed size memory and can learn to perform location and content-based read and write operations. However, unlike those models it has a simple parallel architecture in which several memory locations can be updated simultaneously. The EntNet sets a new state-of-the-art on the bAbI tasks, and is the first method to solve all the tasks in the 10k training examples setting.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
MethodsSoftmax · Sigmoid Activation · Tanh Activation · Neural Turing Machine · Recurrent Entity Network · Location-based Attention · Content-based Attention · Long Short-Term Memory
