On Shapley Credit Allocation for Interpretability

Debraj Basu

arXiv:2012.05506·stat.ML·December 11, 2020

On Shapley Credit Allocation for Interpretability

Debraj Basu

PDF

Open Access

TL;DR

This paper extends Shapley value-based interpretability to include causal and indirect effects, proposing new measures and categories for understanding model decisions in a more human-like cause-and-effect manner.

Contribution

It introduces a comprehensive framework combining observational, model-specific, and causal interpretations with novel characteristic functions for feature relevance and uncertainty.

Findings

01

Incorporates causal effects into Shapley-based explanations

02

Proposes new measures of uncertainty and dispersion for explanations

03

Demonstrates relevance of these measures for model performance analysis

Abstract

We emphasize the importance of asking the right question when interpreting the decisions of a learning model. We discuss a natural extension of the theoretical machinery from Janzing et. al. 2020, which answers the question "Why did my model predict a person has cancer?" for answering a more involved question, "What caused my model to predict a person has cancer?" While the former quantifies the direct effects of variables on the model, the latter also accounts for indirect effects, thereby providing meaningful insights wherever human beings can reason in terms of cause and effect. We propose three broad categories for interpretations: observational, model-specific and causal each of which are significant in their own right. Furthermore, this paper quantifies feature relevance by weaving different natures of interpretations together with different measures as characteristic functions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Bayesian Modeling and Causal Inference