Low-Rank Prehab: Preparing Neural Networks for SVD Compression

Haoran Qin; Shansita Sharma; Ali Abbasi; Chayne Thrash; Soheil Kolouri

arXiv:2512.01980·cs.LG·December 2, 2025

Low-Rank Prehab: Preparing Neural Networks for SVD Compression

Haoran Qin, Shansita Sharma, Ali Abbasi, Chayne Thrash, Soheil Kolouri

PDF

Open Access

TL;DR

Low-Rank Prehab is a novel fine-tuning approach that prepares neural networks for more effective low-rank SVD compression, reducing accuracy loss and enhancing post-compression performance.

Contribution

We introduce Low-Rank Prehab, a pre-compression fine-tuning method that encourages low-rank structure in weights, improving SVD-based compression outcomes for neural networks.

Findings

01

Prehab reduces accuracy drop after compression.

02

Prehab improves post-finetuning performance.

03

Outperforms state-of-the-art SVD techniques across models.

Abstract

Low-rank approximation methods such as singular value decomposition (SVD) and its variants (e.g., Fisher-weighted SVD, Activation SVD) have recently emerged as effective tools for neural network compression. In this setting, decomposition acts as a "surgical" intervention, followed by fine-tuning that serves as "rehab" to recover accuracy. Inspired by prehabilitation in surgery, we introduce a pre-compression fine-tuning stage, Low-Rank Prehab, that explicitly encourages low-rank structure in weight matrices while preserving task performance. By conditioning the model before SVD, Prehab steers weights toward spectrally compact regions of the parameter space, enabling smoother low-rank approximation and improved recovery. Experiments on large language models (LLMs) and other Transformer-based architectures, including Vision Transformers (ViTs), show that Prehab substantially reduces the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Neural Network Applications