Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear Decision Trees for Discrete Action Systems
Yashesh Dhebar, Kalyanmoy Deb, Subramanya Nageshrao, Ling Zhu and, Dimitar Filev

TL;DR
This paper introduces a method to derive simple, interpretable nonlinear decision-tree policies from complex black-box deep reinforcement learning agents, maintaining high performance in discrete control tasks.
Contribution
It proposes a novel evolutionary bilevel optimization approach to induce hierarchical nonlinear decision trees that approximate and explain black-box policies.
Findings
Achieved interpretable policies with 1-4 nonlinear terms per rule
Matched the performance of black-box DRL agents in various control tasks
Simplified complex policies into understandable decision rules
Abstract
Black-box AI induction methods such as deep reinforcement learning (DRL) are increasingly being used to find optimal policies for a given control task. Although policies represented using a black-box AI are capable of efficiently executing the underlying control task and achieving optimal closed-loop performance, the developed control rules are often complex and neither interpretable nor explainable. In this paper, we use a recently proposed nonlinear decision-tree (NLDT) approach to find a hierarchical set of control rules in an attempt to maximize the open-loop performance for approximating and explaining the pre-trained black-box DRL (oracle) agent using the labelled state-action dataset. Recent advances in nonlinear optimization approaches using evolutionary computation facilitates finding a hierarchical set of nonlinear control rules as a function of state variables using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Smart Grid Energy Management
