Interpretable Local Tree Surrogate Policies

John Mern; Sidhart Krishnan; Anil Yildiz; Kyle Hatch; Mykel J.; Kochenderfer

arXiv:2109.08180·cs.LG·September 20, 2021

Interpretable Local Tree Surrogate Policies

John Mern, Sidhart Krishnan, Anil Yildiz, Kyle Hatch, Mykel J., Kochenderfer

PDF

Open Access

TL;DR

This paper introduces a method to create interpretable policy trees as surrogates for complex neural network policies, enhancing transparency and predictability in high-dimensional decision-making tasks.

Contribution

The paper presents a novel approach to build human-interpretable policy trees that approximate neural network policies, enabling better understanding and trust.

Findings

01

Policy trees accurately mimic neural network policies.

02

The approach improves interpretability without significant loss of performance.

03

Demonstrated effectiveness on simulated tasks.

Abstract

High-dimensional policies, such as those represented by neural networks, cannot be reasonably interpreted by humans. This lack of interpretability reduces the trust users have in policy behavior, limiting their use to low-impact tasks such as video games. Unfortunately, many methods rely on neural network representations for effective learning. In this work, we propose a method to build predictable policy trees as surrogates for policies such as neural networks. The policy trees are easily human interpretable and provide quantitative predictions of future behavior. We demonstrate the performance of this approach on several simulated tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics