Local Explanations for Reinforcement Learning

Ronny Luss; Amit Dhurandhar; Miao Liu

arXiv:2202.03597·cs.LG·November 29, 2022

Local Explanations for Reinforcement Learning

Ronny Luss, Amit Dhurandhar, Miao Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel method for explaining deep reinforcement learning policies by identifying important states based on locality governed by policy dynamics, improving interpretability without relying on state space topology.

Contribution

We propose a new approach to explain RL policies through meta-states formed by locality, with theoretical convergence guarantees and efficient selection of important states.

Findings

01

Meta-states improve policy interpretability

02

Algorithm converges and is computationally efficient

03

User study confirms better understanding of policies

Abstract

Many works in explainable AI have focused on explaining black-box classification models. Explaining deep reinforcement learning (RL) policies in a manner that could be understood by domain users has received much less attention. In this paper, we propose a novel perspective to understanding RL policies based on identifying important states from automatically learned meta-states. The key conceptual difference between our approach and many previous ones is that we form meta-states based on locality governed by the expert policy dynamics rather than based on similarity of actions, and that we do not assume any particular knowledge of the underlying topology of the state space. Theoretically, we show that our algorithm to find meta-states converges and the objective that selects important states from each meta-state is submodular leading to efficient high quality greedy selection.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Local Explanations for Reinforcement Learning· underline

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Stock Market Forecasting Methods