NLoRA: Nystr\"om-Initiated Low-Rank Adaptation for Large Language Models
Chenlu Guo, Yuan Wu, Yi Chang

TL;DR
This paper introduces Nystr"om-Initiated Low-Rank Adaptation (NLoRA) and related methods to improve the efficiency and convergence of parameter-efficient fine-tuning of large language models, achieving better performance with fewer parameters.
Contribution
The paper proposes NLoRA, SLoRA, and IntTune, novel methods that leverage the Nystr"om method and intermediate matrices to enhance LoRA's efficiency and effectiveness in LLM fine-tuning.
Findings
NLoRA outperforms LoRA with 36.41% higher accuracy on GSM8K.
IntTune improves NLG performance by 7.45% using only 1.25% of LoRA's parameters.
Methods achieve significant accuracy gains with minimal additional trainable parameters.
Abstract
Parameter-efficient fine-tuning (PEFT) is essential for adapting large language models (LLMs), with low-rank adaptation (LoRA) being the most popular approach. However, LoRA suffers from slow convergence, and some recent LoRA variants, such as PiSSA, primarily rely on Singular Value Decomposition (SVD) for initialization, leading to expensive computation. To mitigate these problems, we use the Nystr\"om method, which follows a three-matrix manipulation. We first introduce StructuredLoRA (SLoRA), which investigates adding a small intermediate matrix between the low-rank matrices A and B. Secondly, we propose Nystr\"omLoRA (NLoRA), which leverages Nystr\"om-based initialization for SLoRA to improve its effectiveness and efficiency. Finally, we propose IntermediateTune (IntTune), which explores fine-tuning exclusively on the intermediate matrix of NLoRA to further boost LLM efficiency. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Speech Recognition and Synthesis
