Model Compression using Progressive Channel Pruning
Jinyang Guo, Weichen Zhang, Wanli Ouyang, Dong Xu

TL;DR
This paper introduces Progressive Channel Pruning (PCP), a novel iterative framework for CNN channel pruning that improves efficiency and accuracy retention, applicable to both supervised and transfer learning scenarios.
Contribution
The paper presents a new progressive pruning framework that iteratively prunes channels based on estimated accuracy drops, outperforming existing methods in supervised and transfer learning.
Findings
Outperforms existing channel pruning methods in benchmarks.
Effectively reduces data distribution mismatch in transfer learning.
Maintains higher accuracy with fewer channels pruned.
Abstract
In this work, we propose a simple but effective channel pruning framework called Progressive Channel Pruning (PCP) to accelerate Convolutional Neural Networks (CNNs). In contrast to the existing channel pruning methods that prune channels only once per layer in a layer-by-layer fashion, our new progressive framework iteratively prunes a small number of channels from several selected layers, which consists of a three-step attempting-selecting-pruning pipeline in each iteration. In the attempting step, we attempt to prune a pre-defined number of channels from one layer by using any existing channel pruning methods and estimate the accuracy drop for this layer based on the labelled samples in the validation set. In the selecting step, based on the estimated accuracy drops for all layers, we propose a greedy strategy to automatically select a set of layers that will lead to less overall…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
