Group Pruning using a Bounded-Lp norm for Group Gating and   Regularization

Chaithanya Kumar Mummadi; Tim Genewein; Dan Zhang; Thomas Brox; Volker; Fischer

arXiv:1908.03463·stat.ML·August 12, 2019·1 cites

Group Pruning using a Bounded-Lp norm for Group Gating and Regularization

Chaithanya Kumar Mummadi, Tim Genewein, Dan Zhang, Thomas Brox, Volker, Fischer

PDF

Open Access

TL;DR

This paper introduces a group-wise pruning method using a bounded-Lp norm for gating and regularization, significantly reducing model size while maintaining accuracy across multiple neural network architectures and datasets.

Contribution

It proposes a novel bounded-Lp regularizer and gating mechanism for effective group pruning during training, improving network efficiency without sacrificing performance.

Findings

01

ResNet-164 parameters reduced by 30% on CIFAR100

02

DenseNet-40 parameters reduced by 69% on CIFAR100

03

MobileNetV2 parameters reduced by 75% on CIFAR100 and further compressed on ImageNet

Abstract

Deep neural networks achieve state-of-the-art results on several tasks while increasing in complexity. It has been shown that neural networks can be pruned during training by imposing sparsity inducing regularizers. In this paper, we investigate two techniques for group-wise pruning during training in order to improve network efficiency. We propose a gating factor after every convolutional layer to induce channel level sparsity, encouraging insignificant channels to become exactly zero. Further, we introduce and analyse a bounded variant of the L1 regularizer, which interpolates between L1 and L0-norms to retain performance of the network at higher pruning rates. To underline effectiveness of the proposed methods,we show that the number of parameters of ResNet-164, DenseNet-40 and MobileNetV2 can be reduced down by 30%, 69% and 75% on CIFAR100 respectively without a significant drop in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition

MethodsPruning · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Batch Normalization · Inverted Residual Block · Average Pooling · 1x1 Convolution · Convolution · Tether Customer Service Number +1-833-534-1729