Singular Value Decomposition on Kronecker Adaptation for Large Language Model
Yee Hin Chong, Peng Qu

TL;DR
This paper introduces SoKA, a parameter-efficient fine-tuning method for large language models that combines Kronecker-product tensor factorization with SVD and dynamic rank selection, achieving comparable or better performance with fewer parameters.
Contribution
We propose SoKA, a novel PEFT approach that integrates Kronecker-product SVD and adaptive rank pruning for efficient large model adaptation.
Findings
Requires only 0.99M trainable parameters, 25% fewer than LoRA/PiSSA.
Matches or exceeds baseline performance across multiple tasks.
Demonstrates faster convergence and more stable gradients.
Abstract
Large pre-trained Transformer models achieve state-of-the-art results across diverse language and reasoning tasks, but full fine-tuning incurs substantial storage, memory, and computational overhead. Parameter-efficient fine-tuning (PEFT) methods mitigate these costs by learning only a small subset of task-specific parameters, yet existing approaches either introduce inference-time latency (adapter modules), suffer from suboptimal convergence (randomly initialized low-rank updates), or rely on fixed rank choices that may not match task complexity (Kronecker-based decompositions). We propose SoKA (SVD on Kronecker Adaptation), a novel PEFT strategy that combines Kronecker-product tensor factorization with SVD-driven initialization and spectrum-aware dynamic rank selection. Our Kronecker-Product SVD (KPSVD) procedure extracts principal components of the full weight update into compact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
