SHAP-Guided Kernel Actor-Critic for Explainable Reinforcement Learning

Na Li; Hangguan Shan; Wei Ni; Wenjie Zhang; Xinyu Li

arXiv:2512.05291·cs.LG·February 2, 2026

SHAP-Guided Kernel Actor-Critic for Explainable Reinforcement Learning

Na Li, Hangguan Shan, Wei Ni, Wenjie Zhang, Xinyu Li

PDF

Open Access

TL;DR

This paper introduces RSA2C, an explainable reinforcement learning algorithm that uses SHAP-based attributions within a kernelized actor-critic framework to improve interpretability, stability, and efficiency.

Contribution

It proposes a novel attribution-aware, kernelized actor-critic method utilizing SHAP for state attribution, enhancing interpretability and stability in RL.

Findings

01

RSA2C outperforms baselines in continuous-control tasks.

02

It provides stable convergence bounds under state perturbations.

03

RSA2C offers improved interpretability through state attributions.

Abstract

Actor-critic (AC) methods are a cornerstone of reinforcement learning (RL) but offer limited interpretability. Current explainable RL methods seldom use state attributions to assist training. Rather, they treat all state features equally, thereby neglecting the heterogeneous impacts of individual state dimensions on the reward. We propose RKHS-SHAP-based Advanced Actor-Critic (RSA2C), an attribution-aware, kernelized, two-timescale AC algorithm, including Actor, Value Critic, and Advantage Critic. The Actor is instantiated in a vector-valued reproducing kernel Hilbert space (RKHS) with a Mahalanobis-weighted operator-valued kernel, while the Value Critic and Advantage Critic reside in scalar RKHSs. These RKHS-enhanced components use sparsified dictionaries: the Value Critic maintains its own dictionary, while the Actor and Advantage Critic share one. State attributions, computed from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Model Reduction and Neural Networks