Stabilizing Estimates of Shapley Values with Control Variates

Jeremy Goldwasser; Giles Hooker

arXiv:2310.07672·stat.ML·April 11, 2024·1 cites

Stabilizing Estimates of Shapley Values with Control Variates

Jeremy Goldwasser, Giles Hooker

PDF

Open Access 1 Repo

TL;DR

This paper introduces ControlSHAP, a Monte Carlo-based method that significantly reduces the variability in Shapley value estimates for model explanations without extra computational cost.

Contribution

It presents a novel control variates approach for stabilizing Shapley value estimates applicable to any machine learning model.

Findings

01

Reduces Monte Carlo variability in Shapley estimates

02

Applicable to high-dimensional datasets

03

Requires minimal additional computation

Abstract

Shapley values are among the most popular tools for explaining predictions of blackbox machine learning models. However, their high computational cost motivates the use of sampling approximations, inducing a considerable degree of uncertainty. To stabilize these model explanations, we propose ControlSHAP, an approach based on the Monte Carlo technique of control variates. Our methodology is applicable to any machine learning model and requires virtually no extra computation or modeling effort. On several high-dimensional datasets, we find it can produce dramatic reductions in the Monte Carlo variability of Shapley estimates.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jeremy-goldwasser/controlshap
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Statistical Methods and Inference · Machine Learning and Data Classification