Group Pruning using a Bounded-Lp norm for Group Gating and Regularization
Chaithanya Kumar Mummadi, Tim Genewein, Dan Zhang, Thomas Brox, Volker, Fischer

TL;DR
This paper introduces a group-wise pruning method using a bounded-Lp norm for gating and regularization, significantly reducing model size while maintaining accuracy across multiple neural network architectures and datasets.
Contribution
It proposes a novel bounded-Lp regularizer and gating mechanism for effective group pruning during training, improving network efficiency without sacrificing performance.
Findings
ResNet-164 parameters reduced by 30% on CIFAR100
DenseNet-40 parameters reduced by 69% on CIFAR100
MobileNetV2 parameters reduced by 75% on CIFAR100 and further compressed on ImageNet
Abstract
Deep neural networks achieve state-of-the-art results on several tasks while increasing in complexity. It has been shown that neural networks can be pruned during training by imposing sparsity inducing regularizers. In this paper, we investigate two techniques for group-wise pruning during training in order to improve network efficiency. We propose a gating factor after every convolutional layer to induce channel level sparsity, encouraging insignificant channels to become exactly zero. Further, we introduce and analyse a bounded variant of the L1 regularizer, which interpolates between L1 and L0-norms to retain performance of the network at higher pruning rates. To underline effectiveness of the proposed methods,we show that the number of parameters of ResNet-164, DenseNet-40 and MobileNetV2 can be reduced down by 30%, 69% and 75% on CIFAR100 respectively without a significant drop in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
MethodsPruning · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Batch Normalization · Inverted Residual Block · Average Pooling · 1x1 Convolution · Convolution · Tether Customer Service Number +1-833-534-1729
