Automatic Block-wise Pruning with Auxiliary Gating Structures for Deep Convolutional Neural Networks
Zhaofeng Si, Honggang Qi, Xiaoyu Song

TL;DR
This paper introduces a novel block-wise pruning method for deep CNNs using auxiliary gating structures and a voting strategy, achieving high compression rates while maintaining performance.
Contribution
It proposes a new structured pruning approach with auxiliary gating and a voting strategy, improving compression and compatibility with other pruning methods.
Findings
Achieves over 93% FLOPs reduction with maintained accuracy.
State-of-the-art compression performance on classification tasks.
Compatible with other pruning methods for enhanced results.
Abstract
Convolutional neural networks are prevailing in deep learning tasks. However, they suffer from massive cost issues when working on mobile devices. Network pruning is an effective method of model compression to handle such problems. This paper presents a novel structured network pruning method with auxiliary gating structures which assigns importance marks to blocks in backbone network as a criterion when pruning. Block-wise pruning is then realized by proposed voting strategy, which is different from prevailing methods who prune a model in small granularity like channel-wise. We further develop a three-stage training scheduling for the proposed architecture incorporating knowledge distillation for better performance. Our experiments demonstrate that our method can achieve state-of-the-arts compression performance for the classification tasks. In addition, our approach can integrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
MethodsPruning · Knowledge Distillation
