A matter of attitude: Focusing on positive and active gradients to boost saliency maps
Oscar Llorente, Jaime Boal, Eugenio F. S\'anchez-\'Ubeda

TL;DR
This paper investigates how preserving the sign of gradients in saliency maps enhances the interpretability of CNNs by better identifying relevant pixels and understanding class influences in multi-class classification.
Contribution
It introduces a method that rescues the sign of gradients in saliency maps, improving the explanation of CNN focus and class influence in image classification.
Findings
Considering gradient sign improves pixel relevance identification.
Analyzing class influence clarifies CNN decision focus.
Enhanced interpretability of CNNs through gradient sign analysis.
Abstract
Saliency maps have become one of the most widely used interpretability techniques for convolutional neural networks (CNN) due to their simplicity and the quality of the insights they provide. However, there are still some doubts about whether these insights are a trustworthy representation of what CNNs use to come up with their predictions. This paper explores how rescuing the sign of the gradients from the saliency map can lead to a deeper understanding of multi-class classification problems. Using both pretrained and trained from scratch CNNs we unveil that considering the sign and the effect not only of the correct class, but also the influence of the other classes, allows to better identify the pixels of the image that the network is really focusing on. Furthermore, how occluding or altering those pixels is expected to affect the outcome also becomes clearer.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
