A Learning Paradigm for Interpretable Gradients

Felipe Torres Figueroa; Hanwei Zhang; Ronan Sicre; Yannis Avrithis,; Stephane Ayache

arXiv:2404.15024·cs.CV·April 24, 2024

A Learning Paradigm for Interpretable Gradients

Felipe Torres Figueroa, Hanwei Zhang, Ronan Sicre, Yannis Avrithis,, Stephane Ayache

PDF

Open Access

TL;DR

This paper introduces a new training method that regularizes gradients to be less noisy and more interpretable, enhancing the quality of saliency maps for convolutional networks.

Contribution

It proposes a novel regularization loss that aligns standard backpropagation gradients with guided backpropagation, improving interpretability.

Findings

01

Gradients become less noisy and more interpretable.

02

Quantitative improvements in interpretability metrics.

03

Enhanced visualization quality across different networks.

Abstract

This paper studies interpretability of convolutional networks by means of saliency maps. Most approaches based on Class Activation Maps (CAM) combine information from fully connected layers and gradient through variants of backpropagation. However, it is well understood that gradients are noisy and alternatives like guided backpropagation have been proposed to obtain better visualization at inference. In this work, we present a novel training approach to improve the quality of gradients for interpretability. In particular, we introduce a regularization loss such that the gradient with respect to the input image obtained by standard backpropagation is similar to the gradient obtained by guided backpropagation. We find that the resulting gradient is qualitatively less noisy and improves quantitatively the interpretability properties of different networks, using several interpretability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning