Algorithms to estimate Shapley value feature attributions
Hugh Chen, Ian C. Covert, Scott M. Lundberg, Su-In Lee

TL;DR
This paper systematically analyzes 24 algorithms for estimating Shapley value feature attributions, clarifying their theoretical foundations, computational strategies, and applicability to different model types, while highlighting gaps and future directions.
Contribution
It provides a comprehensive comparison and characterization of existing Shapley value estimation algorithms, distinguishing between model-agnostic and model-specific approaches.
Findings
Benchmarking of various estimation approaches
Classification into model-agnostic and model-specific methods
Identification of research gaps and future directions
Abstract
Feature attributions based on the Shapley value are popular for explaining machine learning models; however, their estimation is complex from both a theoretical and computational standpoint. We disentangle this complexity into two factors: (1)~the approach to removing feature information, and (2)~the tractable estimation strategy. These two factors provide a natural lens through which we can better understand and compare 24 distinct algorithms. Based on the various feature removal approaches, we describe the multiple types of Shapley value feature attributions and methods to calculate each one. Then, based on the tractable estimation strategies, we characterize two distinct families of approaches: model-agnostic and model-specific approximations. For the model-agnostic approximations, we benchmark a wide class of estimation approaches and tie them to alternative yet equivalent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Machine Learning and Data Classification
