TL;DR
The paper introduces the reservoir stack machine, a neural network model that efficiently recognizes deterministic context-free languages by training only the output layer, achieving zero error with minimal training data and time.
Contribution
It presents a novel reservoir stack machine model that provably recognizes all deterministic context-free languages and simplifies training by focusing only on the output layer.
Findings
Achieves zero error on benchmark tasks
Requires only a few seconds of training
Handles sequences longer than training data
Abstract
Memory-augmented neural networks equip a recurrent neural network with an explicit memory to support tasks that require information storage without interference over long times. A key motivation for such research is to perform classic computation tasks, such as parsing. However, memory-augmented neural networks are notoriously hard to train, requiring many backpropagation epochs and a lot of data. In this paper, we introduce the reservoir stack machine, a model which can provably recognize all deterministic context-free languages and circumvents the training problem by training only the output layer of a recurrent net and employing auxiliary information during training about the desired interaction with a stack. In our experiments, we validate the reservoir stack machine against deep and shallow networks from the literature on three benchmark tasks for Neural Turing machines and six…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
