Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning

Dan Liu; Xue Liu

arXiv:2212.12651·cs.CV·December 27, 2022·1 cites

Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning

Dan Liu, Xue Liu

PDF

Open Access

TL;DR

This paper introduces a novel retraining-free pruning method for neural networks that uses hyperspherical learning and loss penalties, enabling instant recovery and minimal accuracy loss without fine-tuning.

Contribution

The proposed method allows for effective pruning without retraining and introduces a recovery technique by replacing pruned weights with their mean, outperforming existing approaches.

Findings

01

Achieves 50% pruning on ResNet-18 with less than 0.5% accuracy drop.

02

Significantly improves accuracy of pruned MobileNetV2 models compared to conventional methods.

03

Enables instant accuracy recovery by replacing pruned weights with their mean value.

Abstract

Most existing pruning works are resource-intensive, requiring retraining or fine-tuning of the pruned models for accuracy. We propose a retraining-free pruning method based on hyperspherical learning and loss penalty terms. The proposed loss penalty term pushes some of the model weights far from zero, while the rest weight values are pushed near zero and can be safely pruned with no need for retraining and a negligible accuracy drop. In addition, our proposed method can instantly recover the accuracy of a pruned model by replacing the pruned values with their mean value. Our method obtains state-of-the-art results in retraining-free pruning and is evaluated on ResNet-18/50 and MobileNetV2 with ImageNet dataset. One can easily get a 50\% pruned ResNet18 model with a 0.47\% accuracy drop. With fine-tuning, the experiment results show that our method can significantly boost the accuracy of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsPruning · Pointwise Convolution · Batch Normalization · Depthwise Convolution · Convolution · 1x1 Convolution · Depthwise Separable Convolution · Inverted Residual Block · Average Pooling