A generalizable saliency map-based interpretation of model outcome

Shailja Thakur; Sebastian Fischmeister

arXiv:2006.09504·cs.CV·June 22, 2020

A generalizable saliency map-based interpretation of model outcome

Shailja Thakur, Sebastian Fischmeister

PDF

TL;DR

This paper introduces a novel, non-intrusive saliency map-based interpretability method for deep neural networks that localizes relevant input pixels and reconstructs salient inputs, improving understanding of model decisions especially in safety-critical applications.

Contribution

It presents a new approach that optimizes input masks for better localization of relevant features and reconstructs salient inputs to provide global explanations, outperforming existing saliency methods.

Findings

01

Outperforms existing saliency methods in localizing relevant pixels.

02

Achieves 89% classification accuracy in reconstructing salient inputs.

03

Provides both local and global interpretability of neural network decisions.

Abstract

One of the significant challenges of deep neural networks is that the complex nature of the network prevents human comprehension of the outcome of the network. Consequently, the applicability of complex machine learning models is limited in the safety-critical domains, which incurs risk to life and property. To fully exploit the capabilities of complex neural networks, we propose a non-intrusive interpretability technique that uses the input and output of the model to generate a saliency map. The method works by empirically optimizing a randomly initialized input mask by localizing and weighing individual pixels according to their sensitivity towards the target class. Our experiments show that the proposed model interpretability approach performs better than the existing saliency map-based approaches methods at localizing the relevant input pixels. Furthermore, to obtain a global…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInterpretability