When saliency goes off on a tangent: Interpreting Deep Neural Networks with nonlinear saliency maps
Jan Rosenzweig, Zoran Cvetkovic, Ivana Rosenzweig

TL;DR
This paper introduces nonlinear saliency maps that better interpret deep neural networks by capturing their nonlinearity, improving understanding of complex decision processes over traditional linear methods.
Contribution
The paper proposes a novel class of nonlinear saliency map methods that fully account for neural network nonlinearity, enhancing interpretability in complex models.
Findings
Nonlinear saliency maps outperform linear ones on complex problems
They identify more specific decision drivers in neural networks
They improve interpretability of deep learning models
Abstract
A fundamental bottleneck in utilising complex machine learning systems for critical applications has been not knowing why they do and what they do, thus preventing the development of any crucial safety protocols. To date, no method exist that can provide full insight into the granularity of the neural network's decision process. In the past, saliency maps were an early attempt at resolving this problem through sensitivity calculations, whereby dimensions of a data point are selected based on how sensitive the output of the system is to them. However, the success of saliency maps has been at best limited, mainly due to the fact that they interpret the underlying learning system through a linear approximation. We present a novel class of methods for generating nonlinear saliency maps which fully account for the nonlinearity of the underlying learning system. While agreeing with linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques
