Neural Turing Machines
Alex Graves, Greg Wayne, Ivo Danihelka

TL;DR
Neural Turing Machines integrate neural networks with external memory, enabling the learning of simple algorithms through differentiable operations, representing a significant step towards neural systems with algorithmic capabilities.
Contribution
This paper introduces Neural Turing Machines, a novel neural architecture that couples neural networks with external memory for differentiable algorithm learning.
Findings
Successfully learned copying, sorting, and associative recall algorithms
Demonstrated end-to-end differentiable training of neural memory systems
Showed potential for neural systems to perform algorithmic tasks
Abstract
We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Machine or Von Neumann architecture but is differentiable end-to-end, allowing it to be efficiently trained with gradient descent. Preliminary results demonstrate that Neural Turing Machines can infer simple algorithms such as copying, sorting, and associative recall from input and output examples.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing
MethodsSigmoid Activation · Tanh Activation · Neural Turing Machine · Long Short-Term Memory · Content-based Attention
