Accurate Shapley Values for explaining tree-based models

Salim I. Amoukou; Nicolas J-B. Brunel; Tangi Sala\"un

arXiv:2106.03820·stat.ML·June 1, 2023·6 cites

Accurate Shapley Values for explaining tree-based models

Salim I. Amoukou, Nicolas J-B. Brunel, Tangi Sala\"un

PDF

Open Access 1 Repo

TL;DR

This paper introduces two new, more accurate estimators for computing Shapley Values in tree-based models, addressing issues of encoding sensitivity and efficiency, with practical benefits demonstrated through simulations.

Contribution

The paper presents novel estimators for Shapley Values that leverage tree structures for improved accuracy and efficiency in model explanations.

Findings

01

New estimators outperform existing methods in accuracy

02

Simulations show practical gains over state-of-the-art algorithms

03

Methods are implemented in a Python package for accessibility

Abstract

Shapley Values (SV) are widely used in explainable AI, but their estimation and interpretation can be challenging, leading to inaccurate inferences and explanations. As a starting point, we remind an invariance principle for SV and derive the correct approach for computing the SV of categorical variables that are particularly sensitive to the encoding used. In the case of tree-based models, we introduce two estimators of Shapley Values that exploit the tree structure efficiently and are more accurate than state-of-the-art methods. Simulations and comparisons are performed with state-of-the-art algorithms and show the practical gain of our approach. Finally, we discuss the limitations of Shapley Values as a local explanation. These methods are available as a Python package.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

salimamoukou/acv00
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Machine Learning and Data Classification