Explaining Convolutional Neural Networks using Softmax Gradient Layer-wise Relevance Propagation
Brian Kenji Iwana, Ryohei Kuroki, Seiichi Uchida

TL;DR
This paper introduces SGLRP, a novel method for visualizing and understanding CNN predictions by attributing pixel importance using softmax gradients, improving interpretability and class discrimination.
Contribution
It proposes SGLRP, an extension of Deep Taylor Decomposition that leverages softmax gradients for better input attribution and class discrimination in CNNs.
Findings
SGLRP effectively localizes relevant image regions for target classes.
SGLRP outperforms existing LRP-based methods in class discrimination.
The method enhances understanding of CNN decision processes.
Abstract
Convolutional Neural Networks (CNN) have become state-of-the-art in the field of image classification. However, not everything is understood about their inner representations. This paper tackles the interpretability and explainability of the predictions of CNNs for multi-class classification problems. Specifically, we propose a novel visualization method of pixel-wise input attribution called Softmax-Gradient Layer-wise Relevance Propagation (SGLRP). The proposed model is a class discriminate extension to Deep Taylor Decomposition (DTD) using the gradient of softmax to back propagate the relevance of the output probability to the input image. Through qualitative and quantitative analysis, we demonstrate that SGLRP can successfully localize and attribute the regions on input images which contribute to a target object's classification. We show that the proposed method excels at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
MethodsInterpretability · Softmax
