SHAP@k:Efficient and Probably Approximately Correct (PAC) Identification   of Top-k Features

Sanjay Kariyappa; Leonidas Tsepenekas; Freddy L\'ecu\'e; Daniele; Magazzeni

arXiv:2307.04850·cs.LG·July 12, 2023·1 cites

SHAP@k:Efficient and Probably Approximately Correct (PAC) Identification of Top-k Features

Sanjay Kariyappa, Leonidas Tsepenekas, Freddy L\'ecu\'e, Daniele, Magazzeni

PDF

Open Access

TL;DR

This paper introduces KernelSHAP@k and SamplingSHAP@k, novel methods that significantly improve the sample efficiency of identifying top-k features with the highest SHAP values, leveraging multi-armed bandit techniques for better stopping rules and sampling strategies.

Contribution

It frames the top-k SHAP feature identification as an Explore-m problem, applying MAB techniques to enhance sample efficiency and provide PAC guarantees.

Findings

01

Achieves an average 5x reduction in sample usage and runtime

02

Provides PAC guarantees for top-k feature identification

03

Improves efficiency over existing SHAP value estimation methods

Abstract

The SHAP framework provides a principled method to explain the predictions of a model by computing feature importance. Motivated by applications in finance, we introduce the Top-k Identification Problem (TkIP), where the objective is to identify the k features with the highest SHAP values. While any method to compute SHAP values with uncertainty estimates (such as KernelSHAP and SamplingSHAP) can be trivially adapted to solve TkIP, doing so is highly sample inefficient. The goal of our work is to improve the sample efficiency of existing methods in the context of solving TkIP. Our key insight is that TkIP can be framed as an Explore-m problem--a well-studied problem related to multi-armed bandits (MAB). This connection enables us to improve sample efficiency by leveraging two techniques from the MAB literature: (1) a better stopping-condition (to stop sampling) that identifies when PAC…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning

MethodsShapley Additive Explanations