Explanation Multiplicity in SHAP: Characterization and Assessment
Hyunseung Hwang, Seungeun Lee, Lucas Rosenblatt, Steven Euijong Whang, Julia Stoyanovich

TL;DR
This paper investigates the phenomenon of explanation multiplicity in SHAP, revealing that multiple valid explanations for the same decision can vary significantly, challenging the reliability of post-hoc interpretability methods.
Contribution
It introduces a comprehensive methodology to characterize explanation multiplicity, disentangles its sources, and provides baseline comparisons to better interpret explanation stability.
Findings
Explanation multiplicity is widespread across datasets and models.
Common stability metrics can mask significant explanation variability.
Baseline comparisons are essential for meaningful interpretation of explanations.
Abstract
Post-hoc explanations are widely used to justify, contest, and review automated decisions in high-stakes domains such as lending, employment, and healthcare. Among these methods, SHAP is often treated as providing a reliable account of which features mattered for an individual prediction and is routinely used to support recourse, oversight, and accountability. In practice, however, SHAP explanations can differ substantially across repeated runs, even when the individual, prediction task, and trained model are held fixed. We conceptualize and name this phenomenon explanation multiplicity: the existence of multiple, internally valid but substantively different explanations for the same decision. Explanation multiplicity poses a normative challenge for responsible AI deployment, as it undermines expectations that explanations can reliably identify the reasons for an adverse outcome. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI
