Learning Context-Free Languages with Nondeterministic Stack RNNs

Brian DuSell; David Chiang

arXiv:2010.04674·cs.CL·December 1, 2022

Learning Context-Free Languages with Nondeterministic Stack RNNs

Brian DuSell, David Chiang

PDF

1 Repo

TL;DR

This paper introduces a differentiable nondeterministic stack data structure integrated with RNNs, enabling better learning of context-free languages by efficiently representing multiple configurations and outperforming existing models on formal language tasks.

Contribution

The paper proposes a novel nondeterministic stack RNN that simulates nondeterministic pushdown automata, improving learning and generalization on context-free language tasks.

Findings

01

More reliable convergence to algorithmic behavior on deterministic tasks

02

Lower cross-entropy on nondeterministic tasks

03

Outperforms existing stack RNNs in formal language benchmarks

Abstract

We present a differentiable stack data structure that simultaneously and tractably encodes an exponential number of stack configurations, based on Lang's algorithm for simulating nondeterministic pushdown automata. We call the combination of this data structure with a recurrent neural network (RNN) controller a Nondeterministic Stack RNN. We compare our model against existing stack RNNs on various formal languages, demonstrating that our model converges more reliably to algorithmic behavior on deterministic tasks, and achieves lower cross-entropy on inherently nondeterministic tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bdusell/nondeterministic-stack-rnn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.