Extracting Weighted Finite Automata from Recurrent Neural Networks for   Natural Languages

Zeming Wei; Xiyue Zhang; Meng Sun

arXiv:2206.14621·cs.CL·September 28, 2022

Extracting Weighted Finite Automata from Recurrent Neural Networks for Natural Languages

Zeming Wei, Xiyue Zhang, Meng Sun

PDF

Open Access 1 Repo

TL;DR

This paper introduces a scalable method for extracting weighted finite automata from RNNs in natural language processing, improving extraction precision by addressing transition sparsity and enhancing context-awareness.

Contribution

It proposes a novel transition rule extraction approach with data augmentation tactics, significantly improving the interpretability of RNNs for natural language tasks.

Findings

01

Outperforms existing methods in extraction precision

02

Effective in capturing dynamic behaviors of RNNs

03

Scalable to natural language processing models

Abstract

Recurrent Neural Networks (RNNs) have achieved tremendous success in sequential data processing. However, it is quite challenging to interpret and verify RNNs' behaviors directly. To this end, many efforts have been made to extract finite automata from RNNs. Existing approaches such as exact learning are effective in extracting finite-state models to characterize the state dynamics of RNNs for formal languages, but are limited in the scalability to process natural languages. Compositional approaches that are scablable to natural languages fall short in extraction precision. In this paper, we identify the transition sparsity problem that heavily impacts the extraction precision. To address this problem, we propose a transition rule extraction approach, which is scalable to natural language processing models and effective in improving extraction precision. Specifically, we propose an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

weizeming/extract_wfa_from_rnn_for_nl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Machine Learning and Algorithms · Machine Learning in Materials Science