Synthesising Reinforcement Learning Policies through Set-Valued   Inductive Rule Learning

Youri Coppens; Denis Steckelmacher; Catholijn M. Jonker; Ann Now\'e

arXiv:2106.06009·cs.AI·June 14, 2021

Synthesising Reinforcement Learning Policies through Set-Valued Inductive Rule Learning

Youri Coppens, Denis Steckelmacher, Catholijn M. Jonker, Ann Now\'e

PDF

1 Repo

TL;DR

This paper presents a method to convert complex reinforcement learning policies into simple, interpretable rule-based systems by leveraging meta-information about action quality, improving transparency and understanding.

Contribution

It extends the CN2 rule mining algorithm to incorporate meta-information from RL, enabling the distillation of policies into fewer, more interpretable rules with a refinement process for better performance.

Findings

01

Successfully applied on Mario AI benchmark

02

Produced concise rules capturing learned policies

03

Enhanced interpretability of RL policies

Abstract

Today's advanced Reinforcement Learning algorithms produce black-box policies, that are often difficult to interpret and trust for a person. We introduce a policy distilling algorithm, building on the CN2 rule mining algorithm, that distills the policy into a rule-based decision system. At the core of our approach is the fact that an RL process does not just learn a policy, a mapping from states to actions, but also produces extra meta-information, such as action values indicating the quality of alternative actions. This meta-information can indicate whether more than one action is near-optimal for a certain state. We extend CN2 to make it able to leverage knowledge about equally-good actions to distill the policy into fewer rules, increasing its interpretability by a person. Then, to ensure that the rules explain a valid, non-degenerate policy, we introduce a refinement algorithm that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.ai.vub.ac.be/yocoppen/svcn2
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.