Threading the Needle of On and Off-Manifold Value Functions for Shapley Explanations
Chih-Kuan Yeh, Kuan-Yun Lee, Frederick Liu, Pradeep Ravikumar

TL;DR
This paper introduces the Joint Baseline value function for Shapley explanations, which respects data and model properties, is robust to adversarial perturbations, and addresses limitations of existing on- and off-manifold approaches.
Contribution
It formalizes axioms for desirable value functions, proves the uniqueness of the Joint Baseline value function, and demonstrates its effectiveness in experiments.
Findings
Joint Baseline value function satisfies key axioms.
JBshap is robust to adversarial manipulations.
Experimental validation shows improved explanation quality.
Abstract
A popular explainable AI (XAI) approach to quantify feature importance of a given model is via Shapley values. These Shapley values arose in cooperative games, and hence a critical ingredient to compute these in an XAI context is a so-called value function, that computes the "value" of a subset of features, and which connects machine learning models to cooperative games. There are many possible choices for such value functions, which broadly fall into two categories: on-manifold and off-manifold value functions, which take an observational and an interventional viewpoint respectively. Both these classes however have their respective flaws, where on-manifold value functions violate key axiomatic properties and are computationally expensive, while off-manifold value functions pay less heed to the data manifold and evaluate the model on regions for which it wasn't trained. Thus, there is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
