Leveraging Local Structure for Improving Model Explanations: An Information Propagation Approach
Ruo Yang, Binghui Wang, Mustafa Bilgic

TL;DR
This paper introduces IProp, a novel method that enhances model explanations by modeling pixel attribution scores as a dynamic information propagation process, considering local pixel structures for more accurate interpretability.
Contribution
IProp is a new approach that models attribution scores as a propagation process using Markov Reward Processes, improving existing explanation methods by incorporating local pixel structure.
Findings
IProp significantly improves interpretability metrics across various explanation methods.
The method is compatible with multiple attribution techniques and DNN models.
Experimental results demonstrate enhanced explanation quality.
Abstract
Numerous explanation methods have been recently developed to interpret the decisions made by deep neural network (DNN) models. For image classifiers, these methods typically provide an attribution score to each pixel in the image to quantify its contribution to the prediction. However, most of these explanation methods appropriate attribution scores to pixels independently, even though both humans and DNNs make decisions by analyzing a set of closely related pixels simultaneously. Hence, the attribution score of a pixel should be evaluated jointly by considering itself and its structurally-similar pixels. We propose a method called IProp, which models each pixel's individual attribution score as a source of explanatory information and explains the image prediction through the dynamic propagation of information across all pixels. To formulate the information propagation, IProp adopts the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
