State-Regularized Recurrent Neural Networks to Extract Automata and   Explain Predictions

Cheng Wang; Carolin Lawrence; Mathias Niepert

arXiv:2212.05178·cs.LG·December 13, 2022

State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions

Cheng Wang, Carolin Lawrence, Mathias Niepert

PDF

Open Access 1 Datasets

TL;DR

This paper introduces state-regularized RNNs that improve interpretability, automata extraction, and long-term sequence modeling by incorporating a stochastic state transition mechanism, addressing key limitations of traditional RNNs.

Contribution

The paper proposes a novel state-regularization technique for RNNs that enhances interpretability and automata extraction while improving performance on long-term sequence tasks.

Findings

01

Easier extraction of finite state automata from RNNs.

02

RNNs operate more like automata with external memory.

03

Improved interpretability and explainability of RNN predictions.

Abstract

Recurrent neural networks are a widely used class of neural architectures. They have, however, two shortcomings. First, they are often treated as black-box models and as such it is difficult to understand what exactly they learn as well as how they arrive at a particular prediction. Second, they tend to work poorly on sequences requiring long-term memorization, despite having this capacity in principle. We aim to address both shortcomings with a class of recurrent networks that use a stochastic state transition mechanism between cell applications. This mechanism, which we term state-regularization, makes RNNs transition between a finite set of learnable states. We evaluate state-regularized RNNs on (1) regular languages for the purpose of automata extraction; (2) non-regular languages such as balanced parentheses and palindromes where external memory is required; and (3) real-word…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Machine Learning and Algorithms · Advanced Memory and Neural Computing