Why the Agent Made that Decision: Contrastive Explanation Learning for Reinforcement Learning

Rui Zuo; Simon Khan; Zifan Wang; Garrett Ethan Katz; Qinru Qiu

arXiv:2411.16120·cs.AI·September 23, 2025

Why the Agent Made that Decision: Contrastive Explanation Learning for Reinforcement Learning

Rui Zuo, Simon Khan, Zifan Wang, Garrett Ethan Katz, Qinru Qiu

PDF

Open Access

TL;DR

This paper introduces VisionMask, a contrastive learning framework that generates explanations for reinforcement learning decisions by contrasting chosen actions with alternatives, improving interpretability and trust in RL agents.

Contribution

The paper presents a novel contrastive learning approach for explaining RL actions, addressing the gap in interpretability by explicitly contrasting actions in a self-supervised manner.

Findings

01

VisionMask improves human understanding of RL decisions

02

The method maintains high fidelity and robustness

03

It enables effective counterfactual analysis

Abstract

Reinforcement learning (RL) has demonstrated remarkable success in solving complex decision-making problems, yet its adoption in critical domains is hindered by the lack of interpretability in its decision-making processes. Existing explainable AI (xAI) approaches often fail to provide meaningful explanations for RL agents, particularly because they overlook the contrastive nature of human reasoning--answering "why this action instead of that one?". To address this gap, we propose a novel framework of contrastive learning to explain RL selected actions, named $VisionMask$ . VisionMask is trained to generate explanations by explicitly contrasting the agent's chosen action with alternative actions in a given state using a self-supervised manner. We demonstrate the efficacy of our method through experiments across diverse RL environments, evaluating it in terms of faithfulness,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI