Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear   Decision Trees for Discrete Action Systems

Yashesh Dhebar; Kalyanmoy Deb; Subramanya Nageshrao; Ling Zhu and; Dimitar Filev

arXiv:2009.09521·cs.LG·March 16, 2023

Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear Decision Trees for Discrete Action Systems

Yashesh Dhebar, Kalyanmoy Deb, Subramanya Nageshrao, Ling Zhu and, Dimitar Filev

PDF

Open Access

TL;DR

This paper introduces a method to derive simple, interpretable nonlinear decision-tree policies from complex black-box deep reinforcement learning agents, maintaining high performance in discrete control tasks.

Contribution

It proposes a novel evolutionary bilevel optimization approach to induce hierarchical nonlinear decision trees that approximate and explain black-box policies.

Findings

01

Achieved interpretable policies with 1-4 nonlinear terms per rule

02

Matched the performance of black-box DRL agents in various control tasks

03

Simplified complex policies into understandable decision rules

Abstract

Black-box AI induction methods such as deep reinforcement learning (DRL) are increasingly being used to find optimal policies for a given control task. Although policies represented using a black-box AI are capable of efficiently executing the underlying control task and achieving optimal closed-loop performance, the developed control rules are often complex and neither interpretable nor explainable. In this paper, we use a recently proposed nonlinear decision-tree (NLDT) approach to find a hierarchical set of control rules in an attempt to maximize the open-loop performance for approximating and explaining the pre-trained black-box DRL (oracle) agent using the labelled state-action dataset. Recent advances in nonlinear optimization approaches using evolutionary computation facilitates finding a hierarchical set of nonlinear control rules as a function of state variables using a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Smart Grid Energy Management