Joint Multi-Dimension Pruning via Numerical Gradient Update
Zechun Liu, Xiangyu Zhang, Zhiqiang Shen, Zhe Li, Yichen, Wei, Kwang-Ting Cheng, Jian Sun

TL;DR
This paper introduces JointPruning, a novel method for simultaneously pruning spatial, depth, and channel dimensions of neural networks using gradient-based optimization, leading to more efficient and effective model compression.
Contribution
The paper proposes a general framework for joint multi-dimension pruning using numerical gradient updates and self-adapted stochastic gradient estimation, enabling end-to-end optimization across multiple pruning dimensions.
Findings
Achieved 2.5% and 2.6% accuracy improvements on MobileNet V1&V2.
Demonstrated superior performance over state-of-the-art methods on ImageNet.
Efficiently pruned networks with high compression ratios.
Abstract
We present joint multi-dimension pruning (abbreviated as JointPruning), an effective method of pruning a network on three crucial aspects: spatial, depth and channel simultaneously. To tackle these three naturally different dimensions, we proposed a general framework by defining pruning as seeking the best pruning vector (i.e., the numerical value of layer-wise channel number, spacial size, depth) and construct a unique mapping from the pruning vector to the pruned network structures. Then we optimize the pruning vector with gradient update and model joint pruning as a numerical gradient optimization process. To overcome the challenge that there is no explicit function between the loss and the pruning vectors, we proposed self-adapted stochastic gradient estimation to construct a gradient path through network loss to pruning vectors and enable efficient gradient update. We show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
MethodsPruning · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Bottleneck Residual Block · Batch Normalization · Average Pooling · Max Pooling · Global Average Pooling · Residual Connection · Kaiming Initialization
