NeuroMask: Explaining Predictions of Deep Neural Networks through Mask Learning
Moustafa Alzantot, Amy Widdicombe, Simon Julier, Mani Srivastava

TL;DR
NeuroMask is a novel method that generates interpretable explanations for DNN image classifiers by learning masks that highlight important image regions, improving interpretability without retraining models.
Contribution
NeuroMask introduces a mask-learning approach that provides high-quality, interpretable explanations for DNN predictions without altering the training process.
Findings
NeuroMask accurately localizes relevant image regions.
It produces more interpretable explanations than existing methods.
Demonstrated effectiveness on CIFAR-10 and ImageNet datasets.
Abstract
Deep Neural Networks (DNNs) deliver state-of-the-art performance in many image recognition and understanding applications. However, despite their outstanding performance, these models are black-boxes and it is hard to understand how they make their decisions. Over the past few years, researchers have studied the problem of providing explanations of why DNNs predicted their results. However, existing techniques are either obtrusive, requiring changes in model training, or suffer from low output quality. In this paper, we present a novel method, NeuroMask, for generating an interpretable explanation of classification model results. When applied to image classification models, NeuroMask identifies the image parts that are most important to classifier results by applying a mask that hides/reveals different parts of the image, before feeding it back into the model. The mask values are tuned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
