Re-understanding Finite-State Representations of Recurrent Policy   Networks

Mohamad H. Danesh; Anurag Koul; Alan Fern; Saeed Khorram

arXiv:2006.03745·cs.LG·July 13, 2021·5 cites

Re-understanding Finite-State Representations of Recurrent Policy Networks

Mohamad H. Danesh, Anurag Koul, Alan Fern, Saeed Khorram

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a new method for analyzing recurrent neural network policies by using unminimized finite-state machines and interpretability tools, revealing novel insights into their decision-making processes.

Contribution

It introduces an analysis approach that preserves key decision points in FSMs and an attention tool for understanding observation roles, improving interpretability of recurrent policies.

Findings

01

Revealed new insights into policy behaviors in Atari games.

02

Demonstrated the effectiveness of unminimized FSM analysis.

03

Provided tools for deeper understanding of observation influence.

Abstract

We introduce an approach for understanding control policies represented as recurrent neural networks. Recent work has approached this problem by transforming such recurrent policy networks into finite-state machines (FSM) and then analyzing the equivalent minimized FSM. While this led to interesting insights, the minimization process can obscure a deeper understanding of a machine's operation by merging states that are semantically distinct. To address this issue, we introduce an analysis approach that starts with an unminimized FSM and applies more-interpretable reductions that preserve the key decision points of the policy. We also contribute an attention tool to attain a deeper understanding of the role of observations in the decisions. Our case studies on 7 Atari games and 3 control benchmarks demonstrate that the approach can reveal insights that have not been previously noticed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

modanesh/Differential_IG
pytorchOfficial

Videos

Re-understanding Finite-State Representations of Recurrent Policy Networks· slideslive

Taxonomy

TopicsSocial Policy and Reform Studies · Complex Systems and Decision Making