LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules
Ivan Vuli\'c, Adam Grycner, Quentin de Laroussilhe, Jonas Pfeiffer

TL;DR
LoRA-Squeeze enhances parameter-efficient fine-tuning by learning a high-rank solution first and then compressing it via RSVD, resulting in better performance and flexibility across diverse tasks.
Contribution
It introduces a simple, effective post-hoc and in-tuning rank compression method for LoRA modules, improving adaptability and performance.
Findings
Post-hoc compression yields lower-rank adapters that outperform direct training at the target rank.
Gradual in-tuning rank annealing achieves optimal size-performance trade-offs.
Method is effective across multiple text and vision-language tasks.
Abstract
Despite its huge number of variants, standard Low-Rank Adaptation (LoRA) is still a dominant technique for parameter-efficient fine-tuning (PEFT). Nonetheless, it faces persistent challenges, including the pre-selection of an optimal rank and rank-specific hyper-parameters, as well as the deployment complexity of heterogeneous-rank modules and more sophisticated LoRA derivatives. In this work, we introduce LoRA-Squeeze, a simple and efficient methodology that aims to improve standard LoRA learning by changing LoRA module ranks either post-hoc or dynamically during training}. Our approach posits that it is better to first learn an expressive, higher-rank solution and then compress it, rather than learning a constrained, low-rank solution directly. The method involves fine-tuning with a deliberately high(er) source rank, reconstructing or efficiently approximating the reconstruction of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Stochastic Gradient Optimization Techniques
