On the Compression of Neural Networks Using $\ell_0$-Norm Regularization and Weight Pruning
Felipe Dennis de Resende Oliveira, Eduardo Luiz Ortiz Batista, Rui, Seara

TL;DR
This paper introduces a novel neural network compression method combining $\,\ell_0$-norm regularization for sparsity, weight pruning, and fine-tuning, effectively reducing model size while maintaining accuracy, suitable for edge and embedded applications.
Contribution
The paper develops a new $\,\ell_0$-norm regularization technique for inducing sparsity and integrates it with pruning and fine-tuning for improved neural network compression.
Findings
The proposed scheme achieves significant model size reduction.
It maintains high inference accuracy after compression.
Compared to existing methods, it shows superior sparsity and performance trade-offs.
Abstract
Despite the growing availability of high-capacity computational platforms, implementation complexity still has been a great concern for the real-world deployment of neural networks. This concern is not exclusively due to the huge costs of state-of-the-art network architectures, but also due to the recent push towards edge intelligence and the use of neural networks in embedded applications. In this context, network compression techniques have been gaining interest due to their ability for reducing deployment costs while keeping inference accuracy at satisfactory levels. The present paper is dedicated to the development of a novel compression scheme for neural networks. To this end, a new form of -norm-based regularization is firstly developed, which is capable of inducing strong sparseness in the network during training. Then, targeting the smaller weights of the trained network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Advanced Neural Network Applications
MethodsPruning
