InstaSHAP: Interpretable Additive Models Explain Shapley Values   Instantly

James Enouen; Yan Liu

arXiv:2502.14177·cs.LG·February 21, 2025

InstaSHAP: Interpretable Additive Models Explain Shapley Values Instantly

James Enouen, Yan Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces InstaSHAP, a method that leverages additive models to efficiently compute Shapley values for model explanations, revealing limitations in GAM models and their impact on SHAP explanations.

Contribution

It establishes a variational link between GAM models and SHAP, enabling fast Shapley value computation and analyzing the representation limits of GAMs in explanation methods.

Findings

01

GAM models can be trained to compute Shapley values in a single forward pass.

02

The limited representation power of GAMs affects the effectiveness of SHAP explanations.

03

Theoretical analysis shows GAMs' constraints mirror those in SHAP, impacting CV and NLP applications.

Abstract

In recent years, the Shapley value and SHAP explanations have emerged as one of the most dominant paradigms for providing post-hoc explanations of black-box models. Despite their well-founded theoretical properties, many recent works have focused on the limitations in both their computational efficiency and their representation power. The underlying connection with additive models, however, is left critically under-emphasized in the current literature. In this work, we find that a variational perspective linking GAM models and SHAP explanations is able to provide deep insights into nearly all recent developments. In light of this connection, we borrow in the other direction to develop a new method to train interpretable GAM models which are automatically purified to compute the Shapley value in a single forward pass. Finally, we provide theoretical results showing the limited…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

InstaSHAP: Interpretable Additive Models Explain Shapley Values Instantly· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Semantic Web and Ontologies

MethodsShapley Additive Explanations · Generalized additive models