Toward Interpretable Deep Reinforcement Learning with Linear Model   U-Trees

Guiliang Liu; Oliver Schulte; Wang Zhu; Qingcan Li

arXiv:1807.05887·cs.LG·July 17, 2018

Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees

Guiliang Liu, Oliver Schulte, Wang Zhu, Qingcan Li

PDF

Open Access

TL;DR

This paper introduces Linear Model U-trees (LMUTs), a novel method to interpret deep reinforcement learning Q functions by mimicking neural network predictions with transparent, rule-extracting tree models.

Contribution

It develops the first mimic learning framework for Q functions in DRL using LMUTs, enabling interpretability and analysis of neural network policies.

Findings

01

LMUTs outperform baseline methods in mimicking Q functions

02

LMUTs facilitate understanding of feature influence and rule extraction

03

Empirical results demonstrate effective approximation of neural network predictions

Abstract

Deep Reinforcement Learning (DRL) has achieved impressive success in many applications. A key component of many DRL models is a neural network representing a Q function, to estimate the expected cumulative reward following a state-action pair. The Q function neural network contains a lot of implicit knowledge about the RL problems, but often remains unexamined and uninterpreted. To our knowledge, this work develops the first mimic learning framework for Q functions in DRL. We introduce Linear Model U-trees (LMUTs) to approximate neural network predictions. An LMUT is learned using a novel on-line algorithm that is well-suited for an active play setting, where the mimic learner observes an ongoing interaction between the neural net and the environment. Empirical evaluation shows that an LMUT mimics a Q function substantially better than five baseline methods. The transparent tree…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Reinforcement Learning in Robotics · Machine Learning and Algorithms