LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules

Ivan Vuli\'c; Adam Grycner; Quentin de Laroussilhe; Jonas Pfeiffer

arXiv:2602.10993·cs.CL·February 20, 2026

LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules

Ivan Vuli\'c, Adam Grycner, Quentin de Laroussilhe, Jonas Pfeiffer

PDF

Open Access

TL;DR

LoRA-Squeeze enhances parameter-efficient fine-tuning by learning a high-rank solution first and then compressing it via RSVD, resulting in better performance and flexibility across diverse tasks.

Contribution

It introduces a simple, effective post-hoc and in-tuning rank compression method for LoRA modules, improving adaptability and performance.

Findings

01

Post-hoc compression yields lower-rank adapters that outperform direct training at the target rank.

02

Gradual in-tuning rank annealing achieves optimal size-performance trade-offs.

03

Method is effective across multiple text and vision-language tasks.

Abstract

Despite its huge number of variants, standard Low-Rank Adaptation (LoRA) is still a dominant technique for parameter-efficient fine-tuning (PEFT). Nonetheless, it faces persistent challenges, including the pre-selection of an optimal rank and rank-specific hyper-parameters, as well as the deployment complexity of heterogeneous-rank modules and more sophisticated LoRA derivatives. In this work, we introduce LoRA-Squeeze, a simple and efficient methodology that aims to improve standard LoRA learning by changing LoRA module ranks either post-hoc or dynamically during training}. Our approach posits that it is better to first learn an expressive, higher-rank solution and then compress it, rather than learning a constrained, low-rank solution directly. The method involves fine-tuning with a deliberately high(er) source rank, reconstructing or efficiently approximating the reconstruction of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Stochastic Gradient Optimization Techniques