ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement   Learning

Jasmina Gajcin; Ivana Dusparic

arXiv:2203.11211·cs.LG·March 23, 2022·1 cites

ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement Learning

Jasmina Gajcin, Ivana Dusparic

PDF

Open Access 1 Repo

TL;DR

ReCCoVER is a method for detecting when reinforcement learning agents rely on spurious correlations, improving transparency and guiding feature selection to prevent causal confusion in high-risk applications.

Contribution

The paper introduces ReCCoVER, an algorithm that detects causal confusion in DRL agents by testing their policies in alternative environments, enhancing explainability and safety.

Findings

01

ReCCoVER successfully identifies states with causal confusion.

02

The method provides actionable feature recommendations.

03

Demonstrated effectiveness in taxi and grid world environments.

Abstract

Despite notable results in various fields over the recent years, deep reinforcement learning (DRL) algorithms lack transparency, affecting user trust and hindering their deployment to high-risk tasks. Causal confusion refers to a phenomenon where an agent learns spurious correlations between features which might not hold across the entire state space, preventing safe deployment to real tasks where such correlations might be broken. In this work, we examine whether an agent relies on spurious correlations in critical states, and propose an alternative subset of features on which it should base its decisions instead, to make it less susceptible to causal confusion. Our goal is to increase transparency of DRL agents by exposing the influence of learned spurious correlations on its decisions, and offering advice to developers about feature selection in different parts of state space, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anonymous902109/reccover
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics

MethodsFeature Selection · Balanced Selection