Beyond Uniform SVD:Dual-Level Optimization across Columns and Modules for LLM Compression
Lin Xv, Xian Gao, Ting Li, Yuzhuo Fu

TL;DR
Duo-SVD introduces a dual-level, training-free optimization framework for LLM compression that selectively applies low-rank approximation based on component importance, outperforming existing methods.
Contribution
The paper proposes Duo-SVD, a novel dual-level optimization approach that considers component-specific errors and importance, improving LLM compression beyond traditional SVD techniques.
Findings
Duo-SVD outperforms state-of-the-art SVD-based and pruning methods.
The framework effectively retains critical components while compressing less important ones.
Experimental results show significant improvements in model efficiency and accuracy.
Abstract
Low-rank decomposition, particularly Singular Value Decomposition (SVD), is a pivotal technique for mitigating the storage and computational demands of Large Language Models (LLMs). However, prevalent SVD-based approaches overlook the critical phenomenon that decomposition errors exhibit significant disparity across different components of the parameter matrix, often leading to suboptimal approximation. Furthermore, existing methods lack a direct metric to evaluate the importance of individual weight matrices. To address these limitations, we propose Duo-SVD (Dual-level Optimization SVD), a novel training-free framework that synergizes optimization at both the column and the module levels. First, Duo-SVD incorporates a Column-Preserving Strategy that explicitly retains columns exhibiting high decomposition errors, while applying low-rank approximation solely to those with lower errors.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Big Data and Digital Economy
