Relational recurrent neural networks

Adam Santoro; Ryan Faulkner; David Raposo; Jack Rae; Mike Chrzanowski,; Theophane Weber; Daan Wierstra; Oriol Vinyals; Razvan Pascanu; Timothy; Lillicrap

arXiv:1806.01822·cs.LG·June 29, 2018·27 cites

Relational recurrent neural networks

Adam Santoro, Ryan Faulkner, David Raposo, Jack Rae, Mike Chrzanowski,, Theophane Weber, Daan Wierstra, Oriol Vinyals, Razvan Pascanu, Timothy, Lillicrap

PDF

Open Access 2 Repos

TL;DR

This paper introduces the Relational Memory Core, a new memory module for neural networks that enhances relational reasoning capabilities, leading to significant improvements in various sequential tasks and language modeling benchmarks.

Contribution

The paper proposes the Relational Memory Core, a novel memory module using multi-head attention to improve relational reasoning in neural networks.

Findings

01

Large gains in reinforcement learning domains like Mini PacMan

02

State-of-the-art results on WikiText-103, Project Gutenberg, and GigaWord datasets

03

Improved relational reasoning in sequential tasks

Abstract

Memory-based neural networks model temporal data by leveraging an ability to remember information for long periods. It is unclear, however, whether they also have an ability to perform complex relational reasoning with the information they remember. Here, we first confirm our intuitions that standard memory architectures may struggle at tasks that heavily involve an understanding of the ways in which entities are connected -- i.e., tasks involving relational reasoning. We then improve upon these deficits by using a new memory module -- a \textit{Relational Memory Core} (RMC) -- which employs multi-head dot product attention to allow memories to interact. Finally, we test the RMC on a suite of tasks that may profit from more capable relational reasoning across sequential information, and show large gains in RL domains (e.g. Mini PacMan), program evaluation, and language modeling,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning