Sampling Permutations for Shapley Value Estimation

Rory Mitchell; Joshua Cooper; Eibe Frank; Geoffrey Holmes

arXiv:2104.12199·stat.ML·February 4, 2022·26 cites

Sampling Permutations for Shapley Value Estimation

Rory Mitchell, Joshua Cooper, Eibe Frank, Geoffrey Holmes

PDF

Open Access

TL;DR

This paper introduces novel approximation methods for estimating Shapley values in machine learning models, leveraging quadrature, kernel herding, and permutation sampling to improve convergence and accuracy.

Contribution

It proposes new quadrature and sampling techniques based on RKHS and hypersphere connections, enhancing Shapley value estimation efficiency and accuracy.

Findings

01

Significant reduction in RMSE for Shapley estimates

02

Improved convergence over standard Monte Carlo methods

03

Effective permutation sampling algorithms developed

Abstract

Game-theoretic attribution techniques based on Shapley values are used to interpret black-box machine learning models, but their exact calculation is generally NP-hard, requiring approximation methods for non-trivial models. As the computation of Shapley values can be expressed as a summation over a set of permutations, a common approach is to sample a subset of these permutations for approximation. Unfortunately, standard Monte Carlo sampling methods can exhibit slow convergence, and more sophisticated quasi-Monte Carlo methods have not yet been applied to the space of permutations. To address this, we investigate new approaches based on two classes of approximation methods and compare them empirically. First, we demonstrate quadrature techniques in a RKHS containing functions of permutations, using the Mallows kernel in combination with kernel herding and sequential Bayesian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Mathematical Approximation and Integration