SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Arshia Soltani Moakhar, Eugenia Iofinova, Elias Frantar, Dan Alistarh

TL;DR
SPADE introduces a novel sample-specific sparsity-based preprocessing method that enhances neural network interpretability without altering the trained model or its inference behavior.
Contribution
This work is the first to incorporate sparsity directly into the interpretation process as a preprocessing step, improving interpretability methods without retraining or modifying the original network.
Findings
SPADE improves the accuracy of image saliency maps across multiple interpretability methods.
SPADE enhances the usefulness of neuron visualizations for understanding network behavior.
Preprocessing with SPADE does not affect the model's inference performance.
Abstract
It is known that sparsity can improve interpretability for deep neural networks. However, existing methods in the area either require networks that are pre-trained with sparsity constraints, or impose sparsity after the fact, altering the network's general behavior. In this paper, we demonstrate, for the first time, that sparsity can instead be incorporated into the interpretation process itself, as a sample-specific preprocessing step. Unlike previous work, this approach, which we call SPADE, does not place constraints on the trained model and does not affect its behavior during inference on the sample. Given a trained model and a target sample, SPADE uses sample-targeted pruning to provide a "trace" of the network's execution on the sample, reducing the network to the most important connections prior to computing an interpretation. We demonstrate that preprocessing with SPADE…
Peer Reviews
Decision·ICML 2024 Poster
1. The authors' proposed method, SPADE, is straightforward to understand and makes use of recent innovations in related bodies of work (e.g., sparsity solvers) in a clever way. The authors introduce SPADE as a general and customizable approach, and clearly outline different design decisions that one could make to change SPADE in practice (like use of different solvers, objectives, or explainers once the sparse network has been learned). They also thoroughly ablate their approach which I apprec
I am happy to consider adjusting my score if my concerns are addressed. * **Weakness #1: Transparency about limitations of the proposed approach**. I believe that the present draft would be made much stronger if it dedicated more time to thoughtfully discussing limitations and implications of the author's proposed approach. I list what I believe are significant limitations below. I do not think that these limitations weaken the proposed method (all explanation approaches have their own limit
1. **Reproducibility** — The authors describe experimentation settings (including human experiments, datasets, metrics, sparsity ratios, etc) in detail, open-source their code and provide model weights (with Trojan backdoors) for reproducibility. 2. **Organisation and writing** — This paper is very well-written and meticulously organised. The Appendix includes a table of contents and is highly readable. 3. **Evaluation** — SPADE evaluates on a variety of path-based and perturbation-based
1. **Small-scale ImageNet experiments?** — Please clarify if I misunderstood but it appears that the main result (i.e., saliency map "accuracy" of SPADE vs. Dense vs. Sparse FC on ResNet50/ImageNet) is only calculated for **140 test samples out of the available 50,000** in the ImageNet-1K validation set. It seems bold to claim AUC improvements and Pointing Game score gains when evaluation is done on 0.0028 of the actual validation set, especially when it is unclear how/why these 140 chosen sampl
- The idea of pruning the model for a specific input for better explanation is clear. - Extensive experimental results demonstrate the effectiveness of the proposed method.
- The computational effort required for preprocessing by SPADE is relatively high, making it difficult to use in practical situations. - I could not fully understand the details of the human study.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Cell Image Analysis Techniques
MethodsSpatially-Adaptive Normalization · Pruning
