Sparse Explanations of Neural Networks Using Pruned Layer-Wise Relevance Propagation
Paulo Yanez Sarmiento, Simon Witzke, Nadja Klein, Bernhard Y. Renard

TL;DR
This paper introduces a modified layer-wise relevance propagation method that enforces sparsity through relevance pruning, improving the clarity and focus of neural network explanations for complex data like images and genomes.
Contribution
It proposes a relevance pruning approach that enhances explanation sparsity without altering the neural network architecture, applicable to diverse data types.
Findings
Reduces noise in relevance attributions
Concentrates relevance on key features
Effective for both image and genome data
Abstract
Explainability is a key component in many applications involving deep neural networks (DNNs). However, current explanation methods for DNNs commonly leave it to the human observer to distinguish relevant explanations from spurious noise. This is not feasible anymore when going from easily human-accessible data such as images to more complex data such as genome sequences. To facilitate the accessibility of DNN outputs from such complex data and to increase explainability, we present a modification of the widely used explanation method layer-wise relevance propagation. Our approach enforces sparsity directly by pruning the relevance propagation for the different layers. Thereby, we achieve sparser relevance attributions for the input features as well as for the intermediate layers. As the relevance propagation is input-specific, we aim to prune the relevance propagation rather than the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Neural Networks and Applications · Seismology and Earthquake Studies
MethodsPruning
