i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Cameron R. Wolfe; Anastasios Kyrillidis

arXiv:2112.04905·cs.LG·March 31, 2022

i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Cameron R. Wolfe, Anastasios Kyrillidis

PDF

Open Access 1 Repo

TL;DR

i-SpaSP is a new structured pruning algorithm for neural networks inspired by sparse signal recovery, which iteratively identifies important parameter groups to efficiently prune models while maintaining high performance.

Contribution

The paper introduces i-SpaSP, a novel pruning method with theoretical guarantees, demonstrating superior efficiency and performance across various architectures and datasets.

Findings

01

Error decay is polynomial with respect to pruning ratio.

02

i-SpaSP outperforms baseline pruning methods by several orders of magnitude.

03

Effective across multiple architectures and datasets.

Abstract

We propose a novel, structured pruning algorithm for neural networks -- the iterative, Sparse Structured Pruning algorithm, dubbed as i-SpaSP. Inspired by ideas from sparse signal recovery, i-SpaSP operates by iteratively identifying a larger set of important parameter groups (e.g., filters or neurons) within a network that contribute most to the residual between pruned and dense network output, then thresholding these groups based on a smaller, pre-defined pruning ratio. For both two-layer and multi-layer network architectures with ReLU activations, we show the error induced by pruning with i-SpaSP decays polynomially, where the degree of this polynomial becomes arbitrarily large based on the sparsity of the dense network's hidden representations. In our experiments, i-SpaSP is evaluated across a variety of datasets (i.e., MNIST, ImageNet, and XNLI) and architectures (i.e., feed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wolfecameron/i-spasp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Machine Learning and ELM

MethodsPruning · Depthwise Convolution · Batch Normalization · 1x1 Convolution · Pointwise Convolution · Depthwise Separable Convolution · Inverted Residual Block · Average Pooling · Convolution