Effective Model Compression via Stage-wise Pruning

Mingyang Zhang; Xinyi Yu; Jingtao Rong; Linlin Ou

arXiv:2011.04908·cs.CV·September 23, 2021

Effective Model Compression via Stage-wise Pruning

Mingyang Zhang, Xinyi Yu, Jingtao Rong, Linlin Ou

PDF

Open Access

TL;DR

This paper introduces a stage-wise pruning method for deep CNNs that improves supernet training fairness and completeness, leading to more effective model compression and better performance on benchmark datasets.

Contribution

The proposed stage-wise pruning (SWP) method addresses unfull and unfair training issues in Auto-ML pruning by splitting supernets and using inplace distillation, achieving state-of-the-art results.

Findings

01

SWP improves proxy performance accuracy.

02

SWP outperforms previous Auto-ML pruning methods.

03

Achieves state-of-the-art on CIFAR-10 and ImageNet.

Abstract

Automated Machine Learning(Auto-ML) pruning methods aim at searching a pruning strategy automatically to reduce the computational complexity of deep Convolutional Neural Networks(deep CNNs). However, some previous work found that the results of many Auto-ML pruning methods cannot even surpass the results of the uniformly pruning method. In this paper, the ineffectiveness of Auto-ML pruning which is caused by unfull and unfair training of the supernet is shown. A deep supernet suffers from unfull training because it contains too many candidates. To overcome the unfull training, a stage-wise pruning(SWP) method is proposed, which splits a deep supernet into several stage-wise supernets to reduce the candidate number and utilize inplace distillation to supervise the stage training. Besides, A wide supernet is hit by unfair training since the sampling probability of each channel is unequal.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Medical Image Segmentation Techniques

MethodsPruning · Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets