A Survey of Explainable Reinforcement Learning

Stephanie Milani; Nicholay Topin; Manuela Veloso; Fei Fang

arXiv:2202.08434·cs.LG·February 18, 2022·25 cites

A Survey of Explainable Reinforcement Learning

Stephanie Milani, Nicholay Topin, Manuela Veloso, Fei Fang

PDF

Open Access

TL;DR

This survey reviews explainable reinforcement learning (XRL), proposing a taxonomy, summarizing current techniques, identifying gaps, and outlining future research directions in the field.

Contribution

It introduces a novel taxonomy for XRL, organizes existing techniques accordingly, and highlights research gaps and future directions.

Findings

01

Proposes a taxonomy prioritizing RL settings

02

Summarizes current XRL techniques

03

Identifies gaps and future research directions

Abstract

Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine learning that has attracted considerable attention in recent years. The goal of XRL is to elucidate the decision-making process of learning agents in sequential decision-making settings. In this survey, we propose a novel taxonomy for organizing the XRL literature that prioritizes the RL setting. We overview techniques according to this taxonomy. We point out gaps in the literature, which we use to motivate and outline a roadmap for future work.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Data Stream Mining Techniques · Reinforcement Learning in Robotics