Smooth Model Compression without Fine-Tuning
Christina Runkel, Natacha Kuete Meli, Jovita Lukasik, Ander Biguri, Carola-Bibiane Sch\"onlieb, Michael Moeller

TL;DR
This paper introduces a smooth regularization technique during training that enhances model compression efficiency, enabling high accuracy with fewer parameters without the need for fine-tuning.
Contribution
The authors propose a novel smooth regularization method using nuclear norm and derivative penalties, improving structured model compression without fine-tuning.
Findings
Standard pruning performs better on smooth models.
The SVD-based compression achieves up to 91% accuracy with 70% fewer parameters.
The approach outperforms traditional methods without fine-tuning.
Abstract
Compressing and pruning large machine learning models has become a critical step towards their deployment in real-world applications. Standard pruning and compression techniques are typically designed without taking the structure of the network's weights into account, limiting their effectiveness. We explore the impact of smooth regularization on neural network training and model compression. By applying nuclear norm, first- and second-order derivative penalties of the weights during training, we encourage structured smoothness while preserving predictive performance on par with non-smooth models. We find that standard pruning methods often perform better when applied to these smooth models. Building on this observation, we apply a Singular-Value-Decomposition-based compression method that exploits the underlying smooth structure and approximates the model's weight tensors by smaller…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Data Compression Techniques · Embedded Systems Design Techniques
MethodsPruning
