Suboptimal Shapley Value Explanations

Xiaolei Lu

arXiv:2502.12209·stat.ML·February 19, 2025

Suboptimal Shapley Value Explanations

Xiaolei Lu

PDF

Open Access

TL;DR

This paper analyzes the limitations of current baseline choices in Shapley value explanations for DNNs, proposing an uncertainty-based reweighting method to improve explanation accuracy and consistency with human understanding.

Contribution

It identifies the problematic baseline causing bias in Shapley explanations and introduces a novel reweighting mechanism to enhance explanation quality and computational efficiency.

Findings

01

The proposed reweighting improves explanation consistency.

02

The method accelerates Shapley value computation.

03

Experiments validate the effectiveness across NLP tasks.

Abstract

Deep Neural Networks (DNNs) have demonstrated strong capacity in supporting a wide variety of applications. Shapley value has emerged as a prominent tool to analyze feature importance to help people understand the inference process of deep neural models. Computing Shapley value function requires choosing a baseline to represent feature's missingness. However, existing random and conditional baselines could negatively influence the explanation. In this paper, by analyzing the suboptimality of different baselines, we identify the problematic baseline where the asymmetric interaction between $x_{i}^{'}$ (the replacement of the faithful influential feature) and other features has significant directional bias toward the model's output, and conclude that $p (y ∣ x_{i}^{'}) = p (y)$ potentially minimizes the asymmetric interaction involving $x_{i}^{'}$ . We further generalize the uninformativeness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic theories and models