TL;DR
This paper introduces a neural network pruning method based on explainability scores, effectively reducing model size while maintaining or improving accuracy, especially in low-data transfer learning scenarios.
Contribution
The paper presents a novel explainability-inspired criterion for CNN pruning that outperforms existing methods in resource-constrained transfer learning settings.
Findings
Outperforms state-of-the-art pruning criteria with retraining.
Effectively prunes models in low-data transfer scenarios.
Maintains or improves accuracy after pruning.
Abstract
The success of convolutional neural networks (CNNs) in various applications is accompanied by a significant increase in computation and parameter storage costs. Recent efforts to reduce these overheads involve pruning and compressing the weights of various layers while at the same time aiming to not sacrifice performance. In this paper, we propose a novel criterion for CNN pruning inspired by neural network interpretability: The most relevant units, i.e. weights or filters, are automatically found using their relevance scores obtained from concepts of explainable AI (XAI). By exploring this idea, we connect the lines of interpretability and model compression research. We show that our proposed method can efficiently prune CNN models in transfer-learning setups in which networks pre-trained on large corpora are adapted to specialized tasks. The method is evaluated on a broad range of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning · Interpretability
