PSE-Net: Channel Pruning for Convolutional Neural Networks with Parallel-subnets Estimator
Shiguang Wang, Tao Xie, Haijun Liu, Xingcheng Zhang, Jian Cheng

TL;DR
PSE-Net introduces an efficient parallel-subnets training method for channel pruning in CNNs, enabling faster supernet training and better subnet evaluation, leading to improved model compression with minimal performance loss.
Contribution
The paper proposes a novel parallel-subnets estimator and training algorithm that significantly accelerates supernet training and enhances subnet evaluation for channel pruning.
Findings
Outperforms state-of-the-art pruning methods on ImageNet
Achieves higher accuracy with fewer FLOPs
Reduces supernet training time by leveraging parallel training
Abstract
Channel Pruning is one of the most widespread techniques used to compress deep neural networks while maintaining their performances. Currently, a typical pruning algorithm leverages neural architecture search to directly find networks with a configurable width, the key step of which is to identify representative subnet for various pruning ratios by training a supernet. However, current methods mainly follow a serial training strategy to optimize supernet, which is very time-consuming. In this work, we introduce PSE-Net, a novel parallel-subnets estimator for efficient channel pruning. Specifically, we propose a parallel-subnets training algorithm that simulate the forward-backward pass of multiple subnets by droping extraneous features on batch dimension, thus various subnets could be trained in one round. Our proposed algorithm facilitates the efficiency of supernet training and equips…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsPointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Convolution · Average Pooling · Pruning · Batch Normalization · Inverted Residual Block · 1x1 Convolution
