Causal Analysis of Shapley Values: Conditional vs. Marginal
Ilya Rozenfeld

TL;DR
This paper analyzes the differences between conditional and marginal Shapley values in ML explanations, arguing that the marginal approach is more causally sound and resolving a controversy in the literature.
Contribution
It provides a causal perspective to compare Shapley value calculation methods, advocating for the marginal approach over the conditional one.
Findings
Conditional approach is causally unsound
Marginal approach is preferable for causal explanations
Clarifies conflicting recommendations in existing literature
Abstract
Shapley values, a game theoretic concept, has been one of the most popular tools for explaining Machine Learning (ML) models in recent years. Unfortunately, the two most common approaches, conditional and marginal, to calculating Shapley values can lead to different results along with some undesirable side effects when features are correlated. This in turn has led to the situation in the literature where contradictory recommendations regarding choice of an approach are provided by different authors. In this paper we aim to resolve this controversy through the use of causal arguments. We show that the differences arise from the implicit assumptions that are made within each method to deal with missing causal information. We also demonstrate that the conditional approach is fundamentally unsound from a causal perspective. This, together with previous work in [1], leads to the conclusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Voting Systems
