Expanding Sparse Tuning for Low Memory Usage

Shufan Shen; Junshu Sun; Xiangyang Ji; Qingming Huang; Shuhui Wang

arXiv:2411.01800·cs.CV·November 5, 2024

Expanding Sparse Tuning for Low Memory Usage

Shufan Shen, Junshu Sun, Xiangyang Ji, Qingming Huang, Shuhui Wang

PDF

Open Access 1 Repo

TL;DR

SNELL introduces a low-memory sparse tuning method for vision models by decomposing matrices into low-rank forms and applying nonlinear kernels, achieving state-of-the-art results efficiently.

Contribution

The paper proposes SNELL, a novel sparse tuning approach that reduces memory usage by matrix decomposition and nonlinear merging, enabling effective large-scale model adaptation.

Findings

01

SNELL achieves state-of-the-art performance on multiple tasks.

02

It significantly reduces memory usage compared to existing sparse tuning methods.

03

The method effectively adapts large pre-trained models to downstream tasks.

Abstract

Parameter-efficient fine-tuning (PEFT) is an effective method for adapting pre-trained vision models to downstream tasks by tuning a small subset of parameters. Among PEFT methods, sparse tuning achieves superior performance by only adjusting the weights most relevant to downstream tasks, rather than densely tuning the whole weight matrix. However, this performance improvement has been accompanied by increases in memory usage, which stems from two factors, i.e., the storage of the whole weight matrix as learnable parameters in the optimizer and the additional storage of tunable weight indexes. In this paper, we propose a method named SNELL (Sparse tuning with kerNELized LoRA) for sparse tuning with low memory usage. To achieve low memory usage, SNELL decomposes the tunable matrix for sparsification into two learnable low-rank matrices, saving from the costly storage of the whole…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ssfgunner/snell
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques