Structure-Preserving Network Compression Via Low-Rank Induced Training   Through Linear Layers Composition

Xitong Zhang; Ismail R. Alkhouri; Rongrong Wang

arXiv:2405.03089·cs.LG·October 18, 2024

Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition

Xitong Zhang, Ismail R. Alkhouri, Rongrong Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces LoRITa, a training method that promotes low-rank structures in neural networks through layer composition and singular value truncation, enabling effective compression without structural changes at inference.

Contribution

LoRITa is a novel low-rank training technique that does not require pre-trained models, rank pre-specification, or iterative SVD, simplifying network compression.

Findings

01

Achieves competitive or state-of-the-art compression results on multiple datasets.

02

Does not require pre-training or rank pre-specification.

03

Effective across various neural network architectures.

Abstract

Deep Neural Networks (DNNs) have achieved remarkable success in addressing many previously unsolvable tasks. However, the storage and computational requirements associated with DNNs pose a challenge for deploying these trained models on resource-limited devices. Therefore, a plethora of compression and pruning techniques have been proposed in recent years. Low-rank decomposition techniques are among the approaches most utilized to address this problem. Compared to post-training compression, compression-promoted training is still under-explored. In this paper, we present a theoretically-justified technique termed Low-Rank Induced Training (LoRITa), that promotes low-rankness through the composition of linear layers and compresses by using singular value truncation. This is achieved without the need to change the structure at inference time or require constrained and/or additional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xitongsystem/lorita
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Neural Networks and Reservoir Computing · Neural Networks and Applications

MethodsPruning · Weight Decay