Expanding Sparse Tuning for Low Memory Usage
Shufan Shen, Junshu Sun, Xiangyang Ji, Qingming Huang, Shuhui Wang

TL;DR
SNELL introduces a low-memory sparse tuning method for vision models by decomposing matrices into low-rank forms and applying nonlinear kernels, achieving state-of-the-art results efficiently.
Contribution
The paper proposes SNELL, a novel sparse tuning approach that reduces memory usage by matrix decomposition and nonlinear merging, enabling effective large-scale model adaptation.
Findings
SNELL achieves state-of-the-art performance on multiple tasks.
It significantly reduces memory usage compared to existing sparse tuning methods.
The method effectively adapts large pre-trained models to downstream tasks.
Abstract
Parameter-efficient fine-tuning (PEFT) is an effective method for adapting pre-trained vision models to downstream tasks by tuning a small subset of parameters. Among PEFT methods, sparse tuning achieves superior performance by only adjusting the weights most relevant to downstream tasks, rather than densely tuning the whole weight matrix. However, this performance improvement has been accompanied by increases in memory usage, which stems from two factors, i.e., the storage of the whole weight matrix as learnable parameters in the optimizer and the additional storage of tunable weight indexes. In this paper, we propose a method named SNELL (Sparse tuning with kerNELized LoRA) for sparse tuning with low memory usage. To achieve low memory usage, SNELL decomposes the tunable matrix for sparsification into two learnable low-rank matrices, saving from the costly storage of the whole…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques
