Compact Deep Convolutional Neural Networks With Coarse Pruning
Sajid Anwar, Wonyong Sung

TL;DR
This paper introduces a coarse pruning method at feature map and kernel levels to create compact deep convolutional neural networks, significantly reducing computational costs while maintaining accuracy.
Contribution
It presents a novel, simple strategy for coarse-grained pruning that is hardware-friendly and effectively reduces network complexity without substantial accuracy loss.
Findings
Achieved over 85% sparsity in convolution layers
Less than 1% increase in misclassification rate
Effective pruning strategy for real-time inference
Abstract
The learning capability of a neural network improves with increasing depth at higher computational costs. Wider layers with dense kernel connectivity patterns furhter increase this cost and may hinder real-time inference. We propose feature map and kernel level pruning for reducing the computational complexity of a deep convolutional neural network. Pruning feature maps reduces the width of a layer and hence does not need any sparse representation. Further, kernel pruning converts the dense connectivity pattern into a sparse one. Due to coarse nature, these pruning granularities can be exploited by GPUs and VLSI based implementations. We propose a simple and generic strategy to choose the least adversarial pruning masks for both granularities. The pruned networks are retrained which compensates the loss in accuracy. We obtain the best pruning ratios when we prune a network with both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsPruning · Convolution
