Importance-Guided Basis Selection for Low-Rank Decomposition of Large Language Models

Daniel Agyei Asante; Ernie Chang; Yang Li

arXiv:2605.01627·cs.LG·May 8, 2026

Importance-Guided Basis Selection for Low-Rank Decomposition of Large Language Models

Daniel Agyei Asante, Ernie Chang, Yang Li

PDF

TL;DR

This paper introduces BSI, a principled importance-based basis selection method for low-rank compression of large language models, leveraging second-order loss curvature to improve pruning decisions.

Contribution

BSI provides a theoretically grounded, efficient framework for basis pruning in LLMs using a second-order Taylor expansion and a novel Hessian-diagonal estimator.

Findings

01

BSI outperforms existing low-rank decomposition methods on reasoning benchmarks.

02

BSI achieves stronger compression with minimal loss in performance.

03

Theoretical analysis guarantees estimation accuracy and bounds on loss increase.

Abstract

Low-rank decomposition is a compelling approach for compressing large language models, but its effectiveness hinges on selecting which singular-vector bases to retain for a target task. Existing methods such as Basel adapt singular-value coefficients on downstream data and prune bases with small re-learned magnitudes, a heuristic that can be misaligned with task performance because it ignores the local geometry of the loss landscape. We present Basis Selection with Importance (BSI), a principled low-rank compression framework that ranks and prunes bases by directly estimating the expected loss increase incurred when each basis is removed. BSI derives a derivative-based importance score from a second-order Taylor expansion of the task loss with respect to singular values, combining first-order sensitivity and second-order curvature to quantify pruning impact. To make this criterion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.