Feature relevance quantification in explainable AI: A causal problem

Dominik Janzing; Lenon Minorics; and Patrick Bl\"obaum

arXiv:1910.13413·stat.ML·November 28, 2019·156 cites

Feature relevance quantification in explainable AI: A causal problem

Dominik Janzing, Lenon Minorics, and Patrick Bl\"obaum

PDF

Open Access

TL;DR

This paper clarifies the correct probabilistic approach for quantifying feature relevance in explainable AI, emphasizing the importance of using unconditional expectations over conditional ones, and critiques current methods like SHAP.

Contribution

It provides a conceptual clarification based on causality theory, distinguishing between observational and interventional probabilities, and critiques existing implementations of SHAP.

Findings

01

Unconditional expectations are the correct basis for feature dropping.

02

Current SHAP implementations approximate conditional expectations, which may be flawed.

03

The distinction impacts the interpretation of feature relevance in explainable AI.

Abstract

We discuss promising recent contributions on quantifying feature relevance using Shapley values, where we observed some confusion on which probability distribution is the right one for dropped features. We argue that the confusion is based on not carefully distinguishing between observational and interventional conditional probabilities and try a clarification based on Pearl's seminal work on causality. We conclude that unconditional rather than conditional expectations provide the right notion of dropping features in contradiction to the theoretical justification of the software package SHAP. Parts of SHAP are unaffected because unconditional expectations (which we argue to be conceptually right) are used as approximation for the conditional ones, which encouraged others to `improve' SHAP in a way that we believe to be flawed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning

MethodsShapley Additive Explanations