Singular Value Decomposition on Kronecker Adaptation for Large Language Model

Yee Hin Chong; Peng Qu

arXiv:2506.15251·cs.LG·June 19, 2025

Singular Value Decomposition on Kronecker Adaptation for Large Language Model

Yee Hin Chong, Peng Qu

PDF

Open Access

TL;DR

This paper introduces SoKA, a parameter-efficient fine-tuning method for large language models that combines Kronecker-product tensor factorization with SVD and dynamic rank selection, achieving comparable or better performance with fewer parameters.

Contribution

We propose SoKA, a novel PEFT approach that integrates Kronecker-product SVD and adaptive rank pruning for efficient large model adaptation.

Findings

01

Requires only 0.99M trainable parameters, 25% fewer than LoRA/PiSSA.

02

Matches or exceeds baseline performance across multiple tasks.

03

Demonstrates faster convergence and more stable gradients.

Abstract

Large pre-trained Transformer models achieve state-of-the-art results across diverse language and reasoning tasks, but full fine-tuning incurs substantial storage, memory, and computational overhead. Parameter-efficient fine-tuning (PEFT) methods mitigate these costs by learning only a small subset of task-specific parameters, yet existing approaches either introduce inference-time latency (adapter modules), suffer from suboptimal convergence (randomly initialized low-rank updates), or rely on fixed rank choices that may not match task complexity (Kronecker-based decompositions). We propose SoKA (SVD on Kronecker Adaptation), a novel PEFT strategy that combines Kronecker-product tensor factorization with SVD-driven initialization and spectrum-aware dynamic rank selection. Our Kronecker-Product SVD (KPSVD) procedure extracts principal components of the full weight update into compact…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling