Suboptimal Shapley Value Explanations
Xiaolei Lu

TL;DR
This paper analyzes the limitations of current baseline choices in Shapley value explanations for DNNs, proposing an uncertainty-based reweighting method to improve explanation accuracy and consistency with human understanding.
Contribution
It identifies the problematic baseline causing bias in Shapley explanations and introduces a novel reweighting mechanism to enhance explanation quality and computational efficiency.
Findings
The proposed reweighting improves explanation consistency.
The method accelerates Shapley value computation.
Experiments validate the effectiveness across NLP tasks.
Abstract
Deep Neural Networks (DNNs) have demonstrated strong capacity in supporting a wide variety of applications. Shapley value has emerged as a prominent tool to analyze feature importance to help people understand the inference process of deep neural models. Computing Shapley value function requires choosing a baseline to represent feature's missingness. However, existing random and conditional baselines could negatively influence the explanation. In this paper, by analyzing the suboptimality of different baselines, we identify the problematic baseline where the asymmetric interaction between (the replacement of the faithful influential feature) and other features has significant directional bias toward the model's output, and conclude that potentially minimizes the asymmetric interaction involving . We further generalize the uninformativeness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEconomic theories and models
