Why the Agent Made that Decision: Contrastive Explanation Learning for Reinforcement Learning
Rui Zuo, Simon Khan, Zifan Wang, Garrett Ethan Katz, Qinru Qiu

TL;DR
This paper introduces VisionMask, a contrastive learning framework that generates explanations for reinforcement learning decisions by contrasting chosen actions with alternatives, improving interpretability and trust in RL agents.
Contribution
The paper presents a novel contrastive learning approach for explaining RL actions, addressing the gap in interpretability by explicitly contrasting actions in a self-supervised manner.
Findings
VisionMask improves human understanding of RL decisions
The method maintains high fidelity and robustness
It enables effective counterfactual analysis
Abstract
Reinforcement learning (RL) has demonstrated remarkable success in solving complex decision-making problems, yet its adoption in critical domains is hindered by the lack of interpretability in its decision-making processes. Existing explainable AI (xAI) approaches often fail to provide meaningful explanations for RL agents, particularly because they overlook the contrastive nature of human reasoning--answering "why this action instead of that one?". To address this gap, we propose a novel framework of contrastive learning to explain RL selected actions, named . VisionMask is trained to generate explanations by explicitly contrasting the agent's chosen action with alternative actions in a given state using a self-supervised manner. We demonstrate the efficacy of our method through experiments across diverse RL environments, evaluating it in terms of faithfulness,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI
