Receding Neuron Importances for Structured Pruning

Mihai Suteu; Yike Guo

arXiv:2204.06404·cs.LG·April 14, 2022·1 cites

Receding Neuron Importances for Structured Pruning

Mihai Suteu, Yike Guo

PDF

Open Access

TL;DR

This paper introduces a novel structured pruning method that uses a bounded BatchNorm variation and a regularisation term to effectively identify and remove unimportant neurons, enabling larger and less damaging network compression.

Contribution

The paper proposes a new regularisation technique with bounded BatchNorm parameters that selectively suppresses low-importance neurons, improving pruning efficiency and performance.

Findings

01

Enables larger pruning ratios with less accuracy loss.

02

Outperforms existing pruning methods on VGG and ResNet architectures.

03

Achieves effective one-shot pruning on CIFAR and ImageNet datasets.

Abstract

Structured pruning efficiently compresses networks by identifying and removing unimportant neurons. While this can be elegantly achieved by applying sparsity-inducing regularisation on BatchNorm parameters, an L1 penalty would shrink all scaling factors rather than just those of superfluous neurons. To tackle this issue, we introduce a simple BatchNorm variation with bounded scaling parameters, based on which we design a novel regularisation term that suppresses only neurons with low importance. Under our method, the weights of unnecessary neurons effectively recede, producing a polarised bimodal distribution of importances. We show that neural networks trained this way can be pruned to a larger extent and with less deterioration. We one-shot prune VGG and ResNet architectures at different ratios on CIFAR and ImagenNet datasets. In the case of VGG-style networks, our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis

MethodsPruning · Softmax · 1x1 Convolution · Dropout · Dense Connections · Average Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Convolution · Kaiming Initialization