Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition
Xitong Zhang, Ismail R. Alkhouri, Rongrong Wang

TL;DR
This paper introduces LoRITa, a training method that promotes low-rank structures in neural networks through layer composition and singular value truncation, enabling effective compression without structural changes at inference.
Contribution
LoRITa is a novel low-rank training technique that does not require pre-trained models, rank pre-specification, or iterative SVD, simplifying network compression.
Findings
Achieves competitive or state-of-the-art compression results on multiple datasets.
Does not require pre-training or rank pre-specification.
Effective across various neural network architectures.
Abstract
Deep Neural Networks (DNNs) have achieved remarkable success in addressing many previously unsolvable tasks. However, the storage and computational requirements associated with DNNs pose a challenge for deploying these trained models on resource-limited devices. Therefore, a plethora of compression and pruning techniques have been proposed in recent years. Low-rank decomposition techniques are among the approaches most utilized to address this problem. Compared to post-training compression, compression-promoted training is still under-explored. In this paper, we present a theoretically-justified technique termed Low-Rank Induced Training (LoRITa), that promotes low-rankness through the composition of linear layers and compresses by using singular value truncation. This is achieved without the need to change the structure at inference time or require constrained and/or additional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Neural Networks and Reservoir Computing · Neural Networks and Applications
MethodsPruning · Weight Decay
