Exact Shapley Values for Local and Model-True Explanations of Decision Tree Ensembles
Thomas W. Campbell, Heinrich Roder, Robert W. Georgantas III, Joanna, Roder

TL;DR
This paper introduces an exact, computationally efficient method for calculating Shapley values to explain individual predictions of decision tree ensembles, improving transparency and accuracy.
Contribution
The paper presents a novel approach to Shapley value-based feature attribution for decision tree ensembles that is both accurate and computationally competitive.
Findings
The new method accurately reflects model prediction details.
It is computationally competitive with existing methods.
The approach outperforms standard methods on synthetic and real data.
Abstract
Additive feature explanations using Shapley values have become popular for providing transparency into the relative importance of each feature to an individual prediction of a machine learning model. While Shapley values provide a unique additive feature attribution in cooperative game theory, the Shapley values that can be generated for even a single machine learning model are far from unique, with theoretical and implementational decisions affecting the resulting attributions. Here, we consider the application of Shapley values for explaining decision tree ensembles and present a novel approach to Shapley value-based feature attribution that can be applied to random forests and boosted decision trees. This new method provides attributions that accurately reflect details of the model prediction algorithm for individual instances, while being computationally competitive with one of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Imbalanced Data Classification Techniques
