Generalization on the Enhancement of Layerwise Relevance   Interpretability of Deep Neural Network

Erico Tjoa; Guan Cuntai

arXiv:2009.02516·cs.CV·October 20, 2020·1 cites

Generalization on the Enhancement of Layerwise Relevance Interpretability of Deep Neural Network

Erico Tjoa, Guan Cuntai

PDF

Open Access 2 Repos

TL;DR

This paper enhances the interpretability of deep neural networks by generalizing layerwise relevance correction techniques, proposing a filtering method tailored to neural network signal trends, and emphasizing the importance of groundtruth interpretability data.

Contribution

It introduces a generalized layerwise error correction method for relevance maps, considering various error types and leveraging groundtruth interpretability information.

Findings

01

Proposed a filtering technique for relevance signal rectification.

02

Studied error propagation forms in relevance methods.

03

Argued for the use of groundtruth interpretability data.

Abstract

The practical application of deep neural networks are still limited by their lack of transparency. One of the efforts to provide explanation for decisions made by artificial intelligence (AI) is the use of saliency or heat maps highlighting relevant regions that contribute significantly to its prediction. A layer-wise amplitude filtering method was previously introduced to improve the quality of heatmaps, performing error corrections by noise-spike suppression. In this study, we generalize the layerwise error correction by considering any identifiable error and assuming there exists a groundtruth interpretable information. The forms of errors propagated through layerwise relevance methods are studied and we propose a filtering technique for interpretability signal rectification taylored to the trend of signal amplitude of the particular neural network used. Finally, we put forth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsInterpretability