A Partial Regularization Method for Network Compression
E Zhenqian, Gao Weiguo

TL;DR
This paper introduces a partial regularization technique for neural network compression that speeds up training and improves performance by selectively regularizing parameters, leveraging permutation invariance.
Contribution
It proposes a novel partial regularization approach for faster, more effective network compression, differing from traditional full regularization methods.
Findings
Reduced computational complexity and faster training times.
Improved accuracy and generalization in pruned models.
Existence of an optimal network structure depending on input data.
Abstract
Deep Neural Networks have achieved remarkable success relying on the developing availability of GPUs and large-scale datasets with increasing network depth and width. However, due to the expensive computation and intensive memory, researchers have concentrated on designing compression methods in order to make them practical for constrained platforms. In this paper, we propose an approach of partial regularization rather than the original form of penalizing all parameters, which is said to be full regularization, to conduct model compression at a higher speed. It is reasonable and feasible according to the existence of the permutation invariant property of neural networks. Experimental results show that as we expected, the computational complexity is reduced by observing less running time in almost all situations. It should be owing to the fact that partial regularization method invovles…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Advanced Neural Network Applications
