On Shapley Credit Allocation for Interpretability
Debraj Basu

TL;DR
This paper extends Shapley value-based interpretability to include causal and indirect effects, proposing new measures and categories for understanding model decisions in a more human-like cause-and-effect manner.
Contribution
It introduces a comprehensive framework combining observational, model-specific, and causal interpretations with novel characteristic functions for feature relevance and uncertainty.
Findings
Incorporates causal effects into Shapley-based explanations
Proposes new measures of uncertainty and dispersion for explanations
Demonstrates relevance of these measures for model performance analysis
Abstract
We emphasize the importance of asking the right question when interpreting the decisions of a learning model. We discuss a natural extension of the theoretical machinery from Janzing et. al. 2020, which answers the question "Why did my model predict a person has cancer?" for answering a more involved question, "What caused my model to predict a person has cancer?" While the former quantifies the direct effects of variables on the model, the latter also accounts for indirect effects, thereby providing meaningful insights wherever human beings can reason in terms of cause and effect. We propose three broad categories for interpretations: observational, model-specific and causal each of which are significant in their own right. Furthermore, this paper quantifies feature relevance by weaving different natures of interpretations together with different measures as characteristic functions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Bayesian Modeling and Causal Inference
