The Surprising Computational Power of Nondeterministic Stack RNNs

Brian DuSell; David Chiang

arXiv:2210.01343·cs.CL·March 14, 2023

The Surprising Computational Power of Nondeterministic Stack RNNs

Brian DuSell, David Chiang

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper shows that nondeterministic stack RNNs can recognize a broader class of languages, including some non-context-free ones, and introduces a vector-based stack model that improves language modeling performance.

Contribution

The paper reveals unexpected capabilities of nondeterministic stack RNNs and proposes a vector-based stack model to enhance their information capacity.

Findings

01

Recognize non-context-free languages

02

Handle larger alphabet sizes effectively

03

Improve perplexity on Penn Treebank benchmark

Abstract

Traditional recurrent neural networks (RNNs) have a fixed, finite number of memory cells. In theory (assuming bounded range and precision), this limits their formal language recognition power to regular languages, and in practice, RNNs have been shown to be unable to learn many context-free languages (CFLs). In order to expand the class of languages RNNs recognize, prior work has augmented RNNs with a nondeterministic stack data structure, putting them on par with pushdown automata and increasing their language recognition power to CFLs. Nondeterminism is needed for recognizing all CFLs (not just deterministic CFLs), but in this paper, we show that nondeterminism and the neural controller interact to produce two more unexpected abilities. First, the nondeterministic stack RNN can recognize not only CFLs, but also many non-context-free languages. Second, it can recognize languages with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

The Surprising Computational Power of Nondeterministic Stack RNNs· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Ferroelectric and Negative Capacitance Devices