Explainable Reinforcement Learning via a Causal World Model

Zhongwei Yu; Jingqing Ruan; Dengpeng Xing

arXiv:2305.02749·cs.LG·January 19, 2024·1 cites

Explainable Reinforcement Learning via a Causal World Model

Zhongwei Yu, Jingqing Ruan, Dengpeng Xing

PDF

Open Access 1 Repo

TL;DR

This paper introduces a causal world model for reinforcement learning that enhances explainability by capturing long-term action effects through causal chains, while maintaining high accuracy for effective model-based learning.

Contribution

It presents a novel causal world model that explains long-term effects of actions in RL without prior causal knowledge, improving interpretability and accuracy.

Findings

01

Model accurately captures causal influence of actions.

02

Enhances explainability without sacrificing performance.

03

Applicable to model-based reinforcement learning.

Abstract

Generating explanations for reinforcement learning (RL) is challenging as actions may produce long-term effects on the future. In this paper, we develop a novel framework for explainable RL by learning a causal world model without prior knowledge of the causal structure of the environment. The model captures the influence of actions, allowing us to interpret the long-term effects of actions through causal chains, which present how actions influence environmental variables and finally lead to rewards. Different from most explanatory models which suffer from low accuracy, our model remains accurate while improving explainability, making it applicable in model-based learning. As a result, we demonstrate that our causal model can serve as the bridge between explainability and learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

easeonway/explainable-causal-reinforcement-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Data Stream Mining Techniques