Neural Network Compression by Joint Sparsity Promotion and Redundancy Reduction
Tariq M. Khan, Syed S. Naqvi, Antonio Robles-Kelly, and Erik Meijering

TL;DR
This paper introduces a novel neural network compression method combining filter pruning and a gradient counting-based sparsity scheme, significantly reducing model size and memory without performance loss, and outperforming existing methods on segmentation benchmarks.
Contribution
A new training scheme that jointly promotes sparsity and prunes redundant filters using gradient counting, improving compression and performance.
Findings
Significant reduction in network parameters and memory footprint.
Improved performance over MobileNetV3 on segmentation tasks.
Effective compression without accuracy degradation.
Abstract
Compression of convolutional neural network models has recently been dominated by pruning approaches. A class of previous works focuses solely on pruning the unimportant filters to achieve network compression. Another important direction is the design of sparsity-inducing constraints which has also been explored in isolation. This paper presents a novel training scheme based on composite constraints that prune redundant filters and minimize their effect on overall network learning via sparsity promotion. Also, as opposed to prior works that employ pseudo-norm-based sparsity-inducing constraints, we propose a sparse scheme based on gradient counting in our framework. Our tests on several pixel-wise segmentation benchmarks show that the number of neurons and the memory footprint of networks in the test phase are significantly reduced without affecting performance. MobileNetV3 and UNet,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Video Surveillance and Tracking Methods
MethodsPruning · Test · *Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · ReLU6 · Convolution · 1x1 Convolution · Batch Normalization
