Reinforcement Learning Neural Turing Machines - Revised
Wojciech Zaremba, Ilya Sutskever

TL;DR
This paper explores the use of reinforcement learning to train neural networks that interact with discrete external interfaces like memory and tapes, demonstrating Turing completeness for solving algorithmic tasks.
Contribution
It introduces a method for training neural networks to interact with discrete external interfaces using reinforcement learning, extending the capabilities of Neural Turing Machines.
Findings
Neural networks can learn to interact with discrete interfaces.
The model achieves Turing completeness with the proposed interfaces.
Reinforcement learning effectively trains models for algorithmic tasks.
Abstract
The Neural Turing Machine (NTM) is more expressive than all previously considered models because of its external memory. It can be viewed as a broader effort to use abstract external Interfaces and to learn a parametric model that interacts with them. The capabilities of a model can be extended by providing it with proper Interfaces that interact with the world. These external Interfaces include memory, a database, a search engine, or a piece of software such as a theorem verifier. Some of these Interfaces are provided by the developers of the model. However, many important existing Interfaces, such as databases and search engines, are discrete. We examine feasibility of learning models to interact with discrete Interfaces. We investigate the following discrete Interfaces: a memory Tape, an input Tape, and an output Tape. We use a Reinforcement Learning algorithm to train a neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Computability, Logic, AI Algorithms
MethodsSigmoid Activation · Tanh Activation · Neural Turing Machine · Content-based Attention · Long Short-Term Memory
