NLoRA: Nystr\"om-Initiated Low-Rank Adaptation for Large Language Models

Chenlu Guo; Yuan Wu; Yi Chang

arXiv:2502.14482·cs.CL·February 21, 2025

NLoRA: Nystr\"om-Initiated Low-Rank Adaptation for Large Language Models

Chenlu Guo, Yuan Wu, Yi Chang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Nystr"om-Initiated Low-Rank Adaptation (NLoRA) and related methods to improve the efficiency and convergence of parameter-efficient fine-tuning of large language models, achieving better performance with fewer parameters.

Contribution

The paper proposes NLoRA, SLoRA, and IntTune, novel methods that leverage the Nystr"om method and intermediate matrices to enhance LoRA's efficiency and effectiveness in LLM fine-tuning.

Findings

01

NLoRA outperforms LoRA with 36.41% higher accuracy on GSM8K.

02

IntTune improves NLG performance by 7.45% using only 1.25% of LoRA's parameters.

03

Methods achieve significant accuracy gains with minimal additional trainable parameters.

Abstract

Parameter-efficient fine-tuning (PEFT) is essential for adapting large language models (LLMs), with low-rank adaptation (LoRA) being the most popular approach. However, LoRA suffers from slow convergence, and some recent LoRA variants, such as PiSSA, primarily rely on Singular Value Decomposition (SVD) for initialization, leading to expensive computation. To mitigate these problems, we use the Nystr\"om method, which follows a three-matrix manipulation. We first introduce StructuredLoRA (SLoRA), which investigates adding a small intermediate matrix between the low-rank matrices A and B. Secondly, we propose Nystr\"omLoRA (NLoRA), which leverages Nystr\"om-based initialization for SLoRA to improve its effectiveness and efficiency. Finally, we propose IntermediateTune (IntTune), which explores fine-tuning exclusively on the intermediate matrix of NLoRA to further boost LLM efficiency. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TracyGuo2001/NLoRA
pytorchOfficial

Videos

NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis