Sampling Matters in Explanations: Towards Trustworthy Attribution Analysis Building Block in Visual Models through Maximizing Explanation Certainty
R\'ois\'in Luo, James McDermott, Colm O'Riordan

TL;DR
This paper emphasizes the importance of proper sampling in gradient-based attribution methods for visual models, proposing a suppression-based sampling approach to improve explanation certainty and trustworthiness.
Contribution
It introduces a semi-optimal sampling method that aligns sample distribution with natural images, enhancing explanation certainty without adding noise or extra information.
Findings
The proposed sampling method improves explanation quality on ImageNet.
Aligning sample distribution with natural images increases explanation certainty.
The approach outperforms state-of-the-art baselines in attribution analysis.
Abstract
Image attribution analysis seeks to highlight the feature representations learned by visual models such that the highlighted feature maps can reflect the pixel-wise importance of inputs. Gradient integration is a building block in the attribution analysis by integrating the gradients from multiple derived samples to highlight the semantic features relevant to inferences. Such a building block often combines with other information from visual models such as activation or attention maps to form ultimate explanations. Yet, our theoretical analysis demonstrates that the extent to the alignment of the sample distribution in gradient integration with respect to natural image distribution gives a lower bound of explanation certainty. Prior works add noise into images as samples and the noise distributions can lead to low explanation certainty. Counter-intuitively, our experiment shows that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
