TL;DR
This paper introduces Guided Integrated Gradients, an adaptive path method that reduces noise in feature attributions by conditioning the attribution path on the model, resulting in more accurate and aligned saliency maps.
Contribution
The paper proposes Guided IG, a novel adaptive path method that improves attribution quality by minimizing noise accumulation along the IG path, conditioned on the model and input.
Findings
Guided IG produces saliency maps more aligned with model predictions.
Guided IG outperforms existing methods in qualitative and quantitative evaluations.
The adaptive path approach effectively reduces noisy attributions in visual models.
Abstract
Integrated Gradients (IG) is a commonly used feature attribution method for deep neural networks. While IG has many desirable properties, the method often produces spurious/noisy pixel attributions in regions that are not related to the predicted class when applied to visual models. While this has been previously noted, most existing solutions are aimed at addressing the symptoms by explicitly reducing the noise in the resulting attributions. In this work, we show that one of the causes of the problem is the accumulation of noise along the IG path. To minimize the effect of this source of noise, we propose adapting the attribution path itself -- conditioning the path not just on the image but also on the model being explained. We introduce Adaptive Path Methods (APMs) as a generalization of path methods, and Guided IG as a specific instance of an APM. Empirically, Guided IG creates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
