Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He, Xiangyu Zhang, Jian Sun

TL;DR
This paper presents a novel channel pruning technique for deep CNNs that significantly accelerates models like VGG-16, ResNet, and Xception with minimal accuracy loss, enabling faster inference.
Contribution
The authors introduce an iterative two-step channel pruning algorithm based on LASSO regression, improving speed while maintaining accuracy across various architectures.
Findings
Achieved 5x speed-up on VGG-16 with only 0.3% accuracy increase.
Accelerated ResNet and Xception with around 1% accuracy loss at 2x speed-up.
Generalized the pruning method to multi-layer and multi-branch networks.
Abstract
In this paper, we introduce a new channel pruning method to accelerate very deep convolutional neural networks.Given a trained CNN model, we propose an iterative two-step algorithm to effectively prune each layer, by a LASSO regression based channel selection and least square reconstruction. We further generalize this algorithm to multi-layer and multi-branch cases. Our method reduces the accumulated error and enhance the compatibility with various architectures. Our pruned VGG-16 achieves the state-of-the-art results by 5x speed-up along with only 0.3% increase of error. More importantly, our method is able to accelerate modern networks like ResNet, Xception and suffers only 1.4%, 1.0% accuracy loss under 2x speed-up respectively, which is significant. Code has been made publicly available.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsPruning · Batch Normalization · Bottleneck Residual Block · Kaiming Initialization · Residual Block · Average Pooling · Bitcoin Customer Service Number +1-833-534-1729 · Depthwise Convolution · Pointwise Convolution · Global Average Pooling
