Causal Explanation for Reinforcement Learning: Quantifying State and   Temporal Importance

Xiaoxiao Wang; Fanyu Meng; Xin Liu; Zhaodan Kong; Xin Chen

arXiv:2210.13507·cs.AI·July 4, 2023

Causal Explanation for Reinforcement Learning: Quantifying State and Temporal Importance

Xiaoxiao Wang, Fanyu Meng, Xin Liu, Zhaodan Kong, Xin Chen

PDF

Open Access

TL;DR

This paper introduces a causal explanation mechanism for reinforcement learning that quantifies the importance of states and their temporal influence on actions, providing more human-aligned explanations than traditional associational methods.

Contribution

It develops a novel causal explanation approach for RL policies and demonstrates its advantages over existing associational methods through multiple simulation studies.

Findings

01

Causal importance measures outperform associational methods in explaining RL policies.

02

The mechanism provides clearer, more human-aligned explanations of state-action relationships.

03

Simulation results include applications in crop irrigation, Blackjack, collision avoidance, and lunar lander.

Abstract

Explainability plays an increasingly important role in machine learning. Furthermore, humans view the world through a causal lens and thus prefer causal explanations over associational ones. Therefore, in this paper, we develop a causal explanation mechanism that quantifies the causal importance of states on actions and such importance over time. We also demonstrate the advantages of our mechanism over state-of-the-art associational methods in terms of RL policy explanation through a series of simulation studies, including crop irrigation, Blackjack, collision avoidance, and lunar lander.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics