On the Connection between Game-Theoretic Feature Attributions and   Counterfactual Explanations

Emanuele Albini; Shubham Sharma; Saumitra Mishra; Danial Dervovic,; Daniele Magazzeni

arXiv:2307.06941·cs.AI·July 14, 2023

On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations

Emanuele Albini, Shubham Sharma, Saumitra Mishra, Danial Dervovic,, Daniele Magazzeni

PDF

TL;DR

This paper establishes a theoretical link between game-theoretic feature attributions, like SHAP, and counterfactual explanations, revealing their conditions for equivalence and limitations, supported by experiments on multiple datasets.

Contribution

It provides the first formal theoretical connection between feature attributions and counterfactual explanations, extending to various game-theoretic solution concepts.

Findings

01

Under certain conditions, feature attributions and counterfactual explanations are equivalent.

02

Naive use of counterfactuals for feature importance can be misleading.

03

Experimental results validate the theoretical connection across three datasets.

Abstract

Explainable Artificial Intelligence (XAI) has received widespread interest in recent years, and two of the most popular types of explanations are feature attributions, and counterfactual explanations. These classes of approaches have been largely studied independently and the few attempts at reconciling them have been primarily empirical. This work establishes a clear theoretical connection between game-theoretic feature attributions, focusing on but not limited to SHAP, and counterfactuals explanations. After motivating operative changes to Shapley values based feature attributions and counterfactual explanations, we prove that, under conditions, they are in fact equivalent. We then extend the equivalency result to game-theoretic solution concepts beyond Shapley values. Moreover, through the analysis of the conditions of such equivalence, we shed light on the limitations of naively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsCounterfactuals Explanations · Shapley Additive Explanations