TL;DR
This paper introduces TSGB, a novel gradient backpropagation method that produces more accurate, target-specific, and fine-grained saliency maps for CNNs, improving interpretability in visual tasks.
Contribution
The paper proposes TSGB, a new gradient rectification approach for better target-selective and fine-grained saliency maps in CNN explanations, addressing limitations of existing methods.
Findings
TSGB outperforms existing saliency methods on ImageNet and Pascal VOC.
It produces more accurate and reliable target-specific saliency maps.
The method is efficient and suitable for downstream visual tasks.
Abstract
The explanation for deep neural networks has drawn extensive attention in the deep learning community over the past few years. In this work, we study the visual saliency, a.k.a. visual explanation, to interpret convolutional neural networks. Compared to iteration based saliency methods, single backward pass based saliency methods benefit from faster speed, and they are widely used in downstream visual tasks. Thus, we focus on single backward pass based methods. However, existing methods in this category struggle to uccessfully produce fine-grained saliency maps concentrating on specific target classes. That said, producing faithful saliency maps satisfying both target-selectiveness and fine-grainedness using a single backward pass is a challenging problem in the field. To mitigate this problem, we revisit the gradient flow inside the network, and find that the entangled semantics and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
