Sparsity May Be All You Need: Sparse Random Parameter Adaptation

Jesus Rios; Pierre Dognin; Ronny Luss; Karthikeyan N. Ramamurthy

arXiv:2502.15975·cs.CL·September 22, 2025

Sparsity May Be All You Need: Sparse Random Parameter Adaptation

Jesus Rios, Pierre Dognin, Ronny Luss, Karthikeyan N. Ramamurthy

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper shows that randomly selecting a small subset of model parameters to fine-tune can be as effective as structured PEFT methods like LoRA, emphasizing the importance of the number of trainable parameters over their specific arrangement.

Contribution

Introducing a simple random parameter selection method for PEFT that challenges the necessity of structured adapters like LoRA, highlighting the role of parameter count in performance.

Findings

01

Random parameter selection is competitive with LoRA.

02

Number of trainable parameters is crucial for PEFT success.

03

Structured adapters may be less important than parameter quantity.

Abstract

Full fine-tuning of large language models for alignment and task adaptation has become prohibitively expensive as models have grown in size. Parameter-Efficient Fine-Tuning (PEFT) methods aim at significantly reducing the computational and memory resources needed for fine-tuning these models by only training on a small number of parameters instead of all model parameters. Currently, the most popular PEFT method is the Low-Rank Adaptation (LoRA), which freezes the parameters of the model and introduces a small set of trainable parameters in the form of low-rank matrices. We propose simply reducing the number of trainable parameters by randomly selecting a small proportion of the model parameters to train on, while fixing all other parameters, without any additional prior assumptions such as low-rank structures. In this paper, we compare the efficiency and performance of our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IBM/SpaRTA
pytorchOfficial

Videos

Sparsity May Be All You Need: Sparse Random Parameter Adaptation· underline

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification · Stochastic Gradient Optimization Techniques

MethodsSparse Evolutionary Training