Approximating Shapley Explanations in Reinforcement Learning

Daniel Beechey; \"Ozg\"ur \c{S}im\c{s}ek

arXiv:2511.06094·cs.LG·November 11, 2025

Approximating Shapley Explanations in Reinforcement Learning

Daniel Beechey, \"Ozg\"ur \c{S}im\c{s}ek

PDF

Open Access

TL;DR

This paper introduces FastSVERL, a scalable method for approximating Shapley value explanations in reinforcement learning, addressing computational challenges and enabling real-time, interpretable decision-making in complex environments.

Contribution

FastSVERL is a novel, scalable approach that efficiently approximates Shapley explanations specifically tailored for reinforcement learning settings.

Findings

01

FastSVERL significantly reduces computation time for Shapley explanations.

02

It effectively handles temporal dependencies and off-policy data.

03

Enables real-time interpretability in reinforcement learning applications.

Abstract

Reinforcement learning has achieved remarkable success in complex decision-making environments, yet its lack of transparency limits its deployment in practice, especially in safety-critical settings. Shapley values from cooperative game theory provide a principled framework for explaining reinforcement learning; however, the computational cost of Shapley explanations is an obstacle to their use. We introduce FastSVERL, a scalable method for explaining reinforcement learning by approximating Shapley values. FastSVERL is designed to handle the unique challenges of reinforcement learning, including temporal dependencies across multi-step trajectories, learning from off-policy data, and adapting to evolving agent behaviours in real time. FastSVERL introduces a practical, scalable approach for principled and rigorous interpretability in reinforcement learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning