Optimizing Singular Spectrum for Large Language Model Compression
Dengjie Li, Tiancheng Shen, Yao Zhou, Baisong Yang, Zhongying Liu,, Masheng Yang, Bernard Ghanem, Yibo Yang, Yujie Zhong, Ming-Hsuan Yang

TL;DR
This paper introduces SoCo, a data-driven singular spectrum optimization framework for large language model compression, which adaptively prunes components based on learned importance scores to better preserve performance.
Contribution
SoCo employs a learnable diagonal matrix and a three-stage training process to refine importance scores, enabling adaptive pruning and improved compression of LLMs.
Findings
Outperforms state-of-the-art compression methods on multiple benchmarks.
Effectively balances model size reduction with performance retention.
Adaptive pruning based on learned importance scores enhances compression efficiency.
Abstract
Large language models (LLMs) have demonstrated remarkable capabilities, yet prohibitive parameter complexity often hinders their deployment. Existing singular value decomposition (SVD) based compression methods simply deem singular values as importance scores of decomposed components. However, this importance ordered by singular values does not necessarily correlate with the performance of a downstream task. In this work, we introduce SoCo (Singular spectrum optimization for large language model Compression), a novel compression framework that learns to rescale the decomposed components of SVD in a data-driven manner. Concretely, we employ a learnable diagonal matrix to assign importance scores for singular spectrum and develop a three-stage training process that progressively refines these scores from initial coarse compression to fine-grained sparsification-thereby striking an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
