Reservoir Stack Machines

Benjamin Paa{\ss}en; Alexander Schulz; Barbara Hammer

arXiv:2105.01616·cs.NE·July 27, 2021

Reservoir Stack Machines

Benjamin Paa{\ss}en, Alexander Schulz, Barbara Hammer

PDF

1 Repo

TL;DR

The paper introduces the reservoir stack machine, a neural network model that efficiently recognizes deterministic context-free languages by training only the output layer, achieving zero error with minimal training data and time.

Contribution

It presents a novel reservoir stack machine model that provably recognizes all deterministic context-free languages and simplifies training by focusing only on the output layer.

Findings

01

Achieves zero error on benchmark tasks

02

Requires only a few seconds of training

03

Handles sequences longer than training data

Abstract

Memory-augmented neural networks equip a recurrent neural network with an explicit memory to support tasks that require information storage without interference over long times. A key motivation for such research is to perform classic computation tasks, such as parsing. However, memory-augmented neural networks are notoriously hard to train, requiring many backpropagation epochs and a lot of data. In this paper, we introduce the reservoir stack machine, a model which can provably recognize all deterministic context-free languages and circumvents the training problem by training only the output layer of a recurrent net and employing auxiliary information during training about the desired interaction with a stack. In our experiments, we validate the reservoir stack machine against deep and shallow networks from the literature on three benchmark tasks for Neural Turing machines and six…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/bpaassen/reservoir_stack_machines
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.