Effective Model Compression via Stage-wise Pruning
Mingyang Zhang, Xinyi Yu, Jingtao Rong, Linlin Ou

TL;DR
This paper introduces a stage-wise pruning method for deep CNNs that improves supernet training fairness and completeness, leading to more effective model compression and better performance on benchmark datasets.
Contribution
The proposed stage-wise pruning (SWP) method addresses unfull and unfair training issues in Auto-ML pruning by splitting supernets and using inplace distillation, achieving state-of-the-art results.
Findings
SWP improves proxy performance accuracy.
SWP outperforms previous Auto-ML pruning methods.
Achieves state-of-the-art on CIFAR-10 and ImageNet.
Abstract
Automated Machine Learning(Auto-ML) pruning methods aim at searching a pruning strategy automatically to reduce the computational complexity of deep Convolutional Neural Networks(deep CNNs). However, some previous work found that the results of many Auto-ML pruning methods cannot even surpass the results of the uniformly pruning method. In this paper, the ineffectiveness of Auto-ML pruning which is caused by unfull and unfair training of the supernet is shown. A deep supernet suffers from unfull training because it contains too many candidates. To overcome the unfull training, a stage-wise pruning(SWP) method is proposed, which splits a deep supernet into several stage-wise supernets to reduce the candidate number and utilize inplace distillation to supervise the stage training. Besides, A wide supernet is hit by unfair training since the sampling probability of each channel is unequal.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Medical Image Segmentation Techniques
MethodsPruning · Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets
