A matter of attitude: Focusing on positive and active gradients to boost   saliency maps

Oscar Llorente; Jaime Boal; Eugenio F. S\'anchez-\'Ubeda

arXiv:2309.12913·cs.AI·September 25, 2023

A matter of attitude: Focusing on positive and active gradients to boost saliency maps

Oscar Llorente, Jaime Boal, Eugenio F. S\'anchez-\'Ubeda

PDF

Open Access 1 Repo

TL;DR

This paper investigates how preserving the sign of gradients in saliency maps enhances the interpretability of CNNs by better identifying relevant pixels and understanding class influences in multi-class classification.

Contribution

It introduces a method that rescues the sign of gradients in saliency maps, improving the explanation of CNN focus and class influence in image classification.

Findings

01

Considering gradient sign improves pixel relevance identification.

02

Analyzing class influence clarifies CNN decision focus.

03

Enhanced interpretability of CNNs through gradient sign analysis.

Abstract

Saliency maps have become one of the most widely used interpretability techniques for convolutional neural networks (CNN) due to their simplicity and the quality of the insights they provide. However, there are still some doubts about whether these insights are a trustworthy representation of what CNNs use to come up with their predictions. This paper explores how rescuing the sign of the gradients from the saliency map can lead to a deeper understanding of multi-class classification problems. Using both pretrained and trained from scratch CNNs we unveil that considering the sign and the effect not only of the correct class, but also the influence of the other classes, allows to better identify the pixels of the image that the network is really focusing on. Furthermore, how occluding or altering those pixels is expected to affect the outcome also becomes clearer.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oscarllorente/positive_active_saliency_maps
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications