Semifactual Explanations for Reinforcement Learning

Jasmina Gajcin; Jovan Jeromela; Ivana Dusparic

arXiv:2409.05435·cs.AI·September 10, 2024

Semifactual Explanations for Reinforcement Learning

Jasmina Gajcin, Jovan Jeromela, Ivana Dusparic

PDF

Open Access 1 Repo

TL;DR

This paper introduces the first methods for generating semifactual explanations for reinforcement learning agents, enhancing interpretability by providing 'even if' scenarios that clarify decision factors.

Contribution

It defines properties of semifactual explanations in RL and proposes two algorithms, SGRL-Rewind and SGRL-Advance, for generating these explanations.

Findings

01

Semifactuals are easier to reach and more diverse.

02

Generated semifactuals better represent the agent's policy.

03

Algorithms outperform baselines in standard RL environments.

Abstract

Reinforcement Learning (RL) is a learning paradigm in which the agent learns from its environment through trial and error. Deep reinforcement learning (DRL) algorithms represent the agent's policies using neural networks, making their decisions difficult to interpret. Explaining the behaviour of DRL agents is necessary to advance user trust, increase engagement, and facilitate integration with real-life tasks. Semifactual explanations aim to explain an outcome by providing "even if" scenarios, such as "even if the car were moving twice as slowly, it would still have to swerve to avoid crashing". Semifactuals help users understand the effects of different factors on the outcome and support the optimisation of resources. While extensively studied in psychology and even utilised in supervised learning, semifactuals have not been used to explain the decisions of RL systems. In this work, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anonymous902109/sgrl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Software Engineering Research · Explainable Artificial Intelligence (XAI)