SHAP@k:Efficient and Probably Approximately Correct (PAC) Identification of Top-k Features
Sanjay Kariyappa, Leonidas Tsepenekas, Freddy L\'ecu\'e, Daniele, Magazzeni

TL;DR
This paper introduces KernelSHAP@k and SamplingSHAP@k, novel methods that significantly improve the sample efficiency of identifying top-k features with the highest SHAP values, leveraging multi-armed bandit techniques for better stopping rules and sampling strategies.
Contribution
It frames the top-k SHAP feature identification as an Explore-m problem, applying MAB techniques to enhance sample efficiency and provide PAC guarantees.
Findings
Achieves an average 5x reduction in sample usage and runtime
Provides PAC guarantees for top-k feature identification
Improves efficiency over existing SHAP value estimation methods
Abstract
The SHAP framework provides a principled method to explain the predictions of a model by computing feature importance. Motivated by applications in finance, we introduce the Top-k Identification Problem (TkIP), where the objective is to identify the k features with the highest SHAP values. While any method to compute SHAP values with uncertainty estimates (such as KernelSHAP and SamplingSHAP) can be trivially adapted to solve TkIP, doing so is highly sample inefficient. The goal of our work is to improve the sample efficiency of existing methods in the context of solving TkIP. Our key insight is that TkIP can be framed as an Explore-m problem--a well-studied problem related to multi-armed bandits (MAB). This connection enables us to improve sample efficiency by leveraging two techniques from the MAB literature: (1) a better stopping-condition (to stop sampling) that identifies when PAC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
MethodsShapley Additive Explanations
