Interpreting Deep Neural Networks with Relative Sectional Propagation by   Analyzing Comparative Gradients and Hostile Activations

Woo-Jeoung Nam; Jaesik Choi; Seong-Whan Lee

arXiv:2012.03434·cs.CV·December 15, 2020

Interpreting Deep Neural Networks with Relative Sectional Propagation by Analyzing Comparative Gradients and Hostile Activations

Woo-Jeoung Nam, Jaesik Choi, Seong-Whan Lee

PDF

Open Access 1 Video

TL;DR

This paper introduces Relative Sectional Propagation (RSP), a novel attribution method for DNNs that improves interpretability by clearly decomposing predictions and addressing shortcomings of existing backpropagation-based attribution techniques.

Contribution

The paper proposes RSP, a new attribution approach that enhances class-discriminative explanations and objectness clarity by handling hostile factors and applying purging techniques during backpropagation.

Findings

01

RSP outperforms existing methods in Pointing Game, mIoU, and Model Sensitivity evaluations.

02

It provides clearer, more intuitive visualizations of DNN decision processes.

03

Experimental results confirm improved attribution accuracy on PASCAL VOC, MS COCO, and ImageNet datasets.

Abstract

The clear transparency of Deep Neural Networks (DNNs) is hampered by complex internal structures and nonlinear transformations along deep hierarchies. In this paper, we propose a new attribution method, Relative Sectional Propagation (RSP), for fully decomposing the output predictions with the characteristics of class-discriminative attributions and clear objectness. We carefully revisit some shortcomings of backpropagation-based attribution methods, which are trade-off relations in decomposing DNNs. We define hostile factor as an element that interferes with finding the attributions of the target and propagate it in a distinguishable way to overcome the non-suppressed nature of activated neurons. As a result, it is possible to assign the bi-polar relevance scores of the target (positive) and hostile (negative) attributions while maintaining each attribution aligned with the importance.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Interpreting Deep Neural Networks with Relative Sectional Propagation by Analyzing Comparative Gradients and Hostile Activations· underline

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification