On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations
Emanuele Albini, Shubham Sharma, Saumitra Mishra, Danial Dervovic,, Daniele Magazzeni

TL;DR
This paper establishes a theoretical link between game-theoretic feature attributions, like SHAP, and counterfactual explanations, revealing their conditions for equivalence and limitations, supported by experiments on multiple datasets.
Contribution
It provides the first formal theoretical connection between feature attributions and counterfactual explanations, extending to various game-theoretic solution concepts.
Findings
Under certain conditions, feature attributions and counterfactual explanations are equivalent.
Naive use of counterfactuals for feature importance can be misleading.
Experimental results validate the theoretical connection across three datasets.
Abstract
Explainable Artificial Intelligence (XAI) has received widespread interest in recent years, and two of the most popular types of explanations are feature attributions, and counterfactual explanations. These classes of approaches have been largely studied independently and the few attempts at reconciling them have been primarily empirical. This work establishes a clear theoretical connection between game-theoretic feature attributions, focusing on but not limited to SHAP, and counterfactuals explanations. After motivating operative changes to Shapley values based feature attributions and counterfactual explanations, we prove that, under conditions, they are in fact equivalent. We then extend the equivalency result to game-theoretic solution concepts beyond Shapley values. Moreover, through the analysis of the conditions of such equivalence, we shed light on the limitations of naively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsCounterfactuals Explanations · Shapley Additive Explanations
