From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
Ajay Jaiswal, Yifan Wang, Lu Yin, Shiwei Liu, Runjin Chen, Jiawei Zhao, Ananth Grama, Yuandong Tian, Zhangyang Wang

TL;DR
This paper introduces a theoretical and empirical study of low-rank structures in LLM weights, leading to a unified low-rank weight compression and fine-tuning method called WeLore that reduces memory and compute needs.
Contribution
It provides a new theoretical framework linking gradient dynamics to low-rank properties of LLM weights and proposes a novel one-shot low-rank weight compression method, WeLore.
Findings
Different LLM components have varying low-rank structures.
Variable rank reduction minimizes performance drop during compression.
WeLore achieves memory-efficient fine-tuning with comparable or better performance.
Abstract
Large Language Models' (LLMs) weight matrices can often be expressed in low-rank form with potential to relax memory and compute resource requirements. Unlike prior efforts that focus on developing novel matrix decompositions, in this work we study the non-uniform low-rank properties of weight matrices in LLMs through the lens of stabilizing gradient subspace. First, we provide a theoretical framework to understand the stabilization of gradient subspaces through Hessian analysis. Second, we empirically establish an important relationship between gradient dynamics and low-rank expressiveness of weight matrices. Our findings reveal that different LLM components exhibit varying levels of converged low-rank structures, necessitating variable rank reduction across them to minimize drop in performance due to compression. Drawing on this result, we present Weight Low-Rank Projection(WeLore)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGa2O3 and related materials · Plasma Diagnostics and Applications · High voltage insulation and dielectric phenomena
MethodsFocus
