Neural Turing Machines

Alex Graves; Greg Wayne; Ivo Danihelka

arXiv:1410.5401·cs.NE·December 11, 2014·110 cites

Neural Turing Machines

Alex Graves, Greg Wayne, Ivo Danihelka

PDF

Open Access 5 Repos

TL;DR

Neural Turing Machines integrate neural networks with external memory, enabling the learning of simple algorithms through differentiable operations, representing a significant step towards neural systems with algorithmic capabilities.

Contribution

This paper introduces Neural Turing Machines, a novel neural architecture that couples neural networks with external memory for differentiable algorithm learning.

Findings

01

Successfully learned copying, sorting, and associative recall algorithms

02

Demonstrated end-to-end differentiable training of neural memory systems

03

Showed potential for neural systems to perform algorithmic tasks

Abstract

We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Machine or Von Neumann architecture but is differentiable end-to-end, allowing it to be efficiently trained with gradient descent. Preliminary results demonstrate that Neural Turing Machines can infer simple algorithms such as copying, sorting, and associative recall from input and output examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing

MethodsSigmoid Activation · Tanh Activation · Neural Turing Machine · Long Short-Term Memory · Content-based Attention