Refutation of Shapley Values for XAI -- Additional Evidence
Xuanxiang Huang, Joao Marques-Silva

TL;DR
This paper provides additional evidence that Shapley values are inadequate for explainable AI, extending prior criticisms to non-Boolean and multi-class classifiers, and analyzing feature relevance in adversarial examples.
Contribution
It demonstrates the limitations of Shapley values across broader classifier families and in the context of adversarial examples, challenging their effectiveness in XAI.
Findings
Shapley values are inadequate for non-Boolean classifiers.
Shapley values fail for multi-class classifier explanations.
Features in adversarial examples do not include irrelevant features.
Abstract
Recent work demonstrated the inadequacy of Shapley values for explainable artificial intelligence (XAI). Although to disprove a theory a single counterexample suffices, a possible criticism of earlier work is that the focus was solely on Boolean classifiers. To address such possible criticism, this paper demonstrates the inadequacy of Shapley values for families of classifiers where features are not boolean, but also for families of classifiers for which multiple classes can be picked. Furthermore, the paper shows that the features changed in any minimal distance adversarial examples do not include irrelevant features, thus offering further arguments regarding the inadequacy of Shapley values for XAI.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Benford’s Law and Fraud Detection
MethodsFocus
