A flexible, extensible software framework for model compression based on the LC algorithm
Yerlan Idelbayev, Miguel \'A. Carreira-Perpi\~n\'an

TL;DR
This paper introduces a flexible software framework based on the Learning-Compression algorithm that simplifies applying various compression techniques to neural networks, maintaining high performance and extensibility.
Contribution
It presents a modular, extensible Python/PyTorch library implementing the LC algorithm for diverse model compression schemes with minimal effort.
Findings
Supports multiple compression schemes like pruning, quantization, and low-rank methods
Compression runtime comparable to training time
Achieves competitive accuracy and compression ratios
Abstract
We propose a software framework based on the ideas of the Learning-Compression (LC) algorithm, that allows a user to compress a neural network or other machine learning model using different compression schemes with minimal effort. Currently, the supported compressions include pruning, quantization, low-rank methods (including automatically learning the layer ranks), and combinations of those, and the user can choose different compression types for different parts of a neural network. The LC algorithm alternates two types of steps until convergence: a learning (L) step, which trains a model on a dataset (using an algorithm such as SGD); and a compression (C) step, which compresses the model parameters (using a compression scheme such as low-rank or quantization). This decoupling of the "machine learning" aspect from the "signal compression" aspect means that changing the model or the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Blind Source Separation Techniques · Advanced Data Compression Techniques
