MoreauGrad: Sparse and Robust Interpretation of Neural Networks via Moreau Envelope
Jingwei Zhang, Farzan Farnia

TL;DR
MoreauGrad introduces a robust, smooth interpretation method for neural networks based on the Moreau envelope, effectively incorporating sparsity and group-sparsity priors, and demonstrating superior interpretability in vision tasks.
Contribution
It proposes MoreauGrad, a novel interpretation scheme leveraging the Moreau envelope for robustness and sparsity, with efficient computation and applicability to various regularization techniques.
Findings
MoreauGrad provides more robust explanations than standard gradient methods.
The scheme effectively incorporates sparsity and group-sparsity priors.
Empirical results show improved interpretability on vision datasets.
Abstract
Explaining the predictions of deep neural nets has been a topic of great interest in the computer vision literature. While several gradient-based interpretation schemes have been proposed to reveal the influential variables in a neural net's prediction, standard gradient-based interpretation frameworks have been commonly observed to lack robustness to input perturbations and flexibility for incorporating prior knowledge of sparsity and group-sparsity structures. In this work, we propose MoreauGrad as an interpretation scheme based on the classifier neural net's Moreau envelope. We demonstrate that MoreauGrad results in a smooth and robust interpretation of a multi-layer neural network and can be efficiently computed through first-order optimization methods. Furthermore, we show that MoreauGrad can be naturally combined with -norm regularization techniques to output a sparse or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
