Pruning with Compensation: Efficient Channel Pruning for Deep Convolutional Neural Networks
Zhouyang Xie, Yan Fu, Shengzhao Tian, Junlin Zhou, Duanbing Chen

TL;DR
This paper introduces a highly efficient channel pruning method for deep CNNs that significantly reduces computational cost and data requirements by using a novel compensation technique and a binary search algorithm, eliminating the need for extensive re-training.
Contribution
The paper proposes a new pruning approach combining compensation and a binary structural search to improve efficiency and reduce human intervention in CNN pruning.
Findings
Reduces pruning time by 95% on benchmarks
Cuts data usage by 90% compared to traditional methods
Achieves competitive accuracy with state-of-the-art pruning techniques
Abstract
Channel pruning is a promising technique to compress the parameters of deep convolutional neural networks(DCNN) and to speed up the inference. This paper aims to address the long-standing inefficiency of channel pruning. Most channel pruning methods recover the prediction accuracy by re-training the pruned model from the remaining parameters or random initialization. This re-training process is heavily dependent on the sufficiency of computational resources, training data, and human interference(tuning the training strategy). In this paper, a highly efficient pruning method is proposed to significantly reduce the cost of pruning DCNN. The main contributions of our method include: 1) pruning compensation, a fast and data-efficient substitute of re-training to minimize the post-pruning reconstruction loss of features, 2) compensation-aware pruning(CaP), a novel pruning algorithm to remove…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsPruning · Diffusion-Convolutional Neural Networks
