State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions
Cheng Wang, Carolin Lawrence, Mathias Niepert

TL;DR
This paper introduces state-regularized RNNs that improve interpretability, automata extraction, and long-term sequence modeling by incorporating a stochastic state transition mechanism, addressing key limitations of traditional RNNs.
Contribution
The paper proposes a novel state-regularization technique for RNNs that enhances interpretability and automata extraction while improving performance on long-term sequence tasks.
Findings
Easier extraction of finite state automata from RNNs.
RNNs operate more like automata with external memory.
Improved interpretability and explainability of RNN predictions.
Abstract
Recurrent neural networks are a widely used class of neural architectures. They have, however, two shortcomings. First, they are often treated as black-box models and as such it is difficult to understand what exactly they learn as well as how they arrive at a particular prediction. Second, they tend to work poorly on sequences requiring long-term memorization, despite having this capacity in principle. We aim to address both shortcomings with a class of recurrent networks that use a stochastic state transition mechanism between cell applications. This mechanism, which we term state-regularization, makes RNNs transition between a finite set of learnable states. We evaluate state-regularized RNNs on (1) regular languages for the purpose of automata extraction; (2) non-regular languages such as balanced parentheses and palindromes where external memory is required; and (3) real-word…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Machine Learning and Algorithms · Advanced Memory and Neural Computing
