i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery
Cameron R. Wolfe, Anastasios Kyrillidis

TL;DR
i-SpaSP is a new structured pruning algorithm for neural networks inspired by sparse signal recovery, which iteratively identifies important parameter groups to efficiently prune models while maintaining high performance.
Contribution
The paper introduces i-SpaSP, a novel pruning method with theoretical guarantees, demonstrating superior efficiency and performance across various architectures and datasets.
Findings
Error decay is polynomial with respect to pruning ratio.
i-SpaSP outperforms baseline pruning methods by several orders of magnitude.
Effective across multiple architectures and datasets.
Abstract
We propose a novel, structured pruning algorithm for neural networks -- the iterative, Sparse Structured Pruning algorithm, dubbed as i-SpaSP. Inspired by ideas from sparse signal recovery, i-SpaSP operates by iteratively identifying a larger set of important parameter groups (e.g., filters or neurons) within a network that contribute most to the residual between pruned and dense network output, then thresholding these groups based on a smaller, pre-defined pruning ratio. For both two-layer and multi-layer network architectures with ReLU activations, we show the error induced by pruning with i-SpaSP decays polynomially, where the degree of this polynomial becomes arbitrarily large based on the sparsity of the dense network's hidden representations. In our experiments, i-SpaSP is evaluated across a variety of datasets (i.e., MNIST, ImageNet, and XNLI) and architectures (i.e., feed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Machine Learning and ELM
MethodsPruning · Depthwise Convolution · Batch Normalization · 1x1 Convolution · Pointwise Convolution · Depthwise Separable Convolution · Inverted Residual Block · Average Pooling · Convolution
