Towards trustable SHAP scores

Olivier Letoffe; Xuanxiang Huang; Joao Marques-Silva

arXiv:2405.00076·cs.LG·December 20, 2024·2 cites

Towards trustable SHAP scores

Olivier Letoffe, Xuanxiang Huang, Joao Marques-Silva

PDF

Open Access

TL;DR

This paper proposes a modified definition of SHAP scores that extends axiomatic aggregations to Shapley values, addressing known limitations and ensuring more trustworthy feature influence explanations in XAI.

Contribution

It introduces a new SHAP score formulation that avoids previous issues, characterizes its computational complexity, and suggests practical implementation modifications.

Findings

01

New SHAP scores avoid known unsatisfactory cases

02

Computational complexity analysis identifies tractable classifier families

03

Implementation modifications improve reliability with minimal performance impact

Abstract

SHAP scores represent the proposed use of the well-known Shapley values in eXplainable Artificial Intelligence (XAI). Recent work has shown that the exact computation of SHAP scores can produce unsatisfactory results. Concretely, for some ML models, SHAP scores will mislead with respect to relative feature influence. To address these limitations, recently proposed alternatives exploit different axiomatic aggregations, all of which are defined in terms of abductive explanations. However, the proposed axiomatic aggregations are not Shapley values. This paper investigates how SHAP scores can be modified so as to extend axiomatic aggregations to the case of Shapley values in XAI. More importantly, the proposed new definition of SHAP scores avoids all the known cases where unsatisfactory results have been identified. The paper also characterizes the complexity of computing the novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Imbalanced Data Classification Techniques

MethodsShapley Additive Explanations · Characteristic Function Estimation for Discrete Probability Distributions