VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks

Yang Li; Shaobo Han; Shihao Ji

arXiv:2405.15179·cs.CL·October 30, 2024·1 cites

VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks

Yang Li, Shaobo Han, Shihao Ji

PDF

Open Access 1 Repo 1 Video

TL;DR

VB-LoRA introduces a vector bank-based paradigm for parameter-efficient fine-tuning of large language models, drastically reducing storage costs while maintaining or improving performance across various NLP tasks.

Contribution

It proposes a novel 'divide-and-share' approach that shares parameters globally via a vector bank, enabling extreme parameter efficiency in PEFT methods.

Findings

01

VB-LoRA uses only 0.4% of LoRA's parameters on Llama2-13B.

02

Achieves comparable or better performance than state-of-the-art PEFT methods.

03

Effective across diverse NLP tasks including understanding, generation, and reasoning.

Abstract

As the adoption of large language models increases and the need for per-user or per-task model customization grows, the parameter-efficient fine-tuning (PEFT) methods, such as low-rank adaptation (LoRA) and its variants, incur substantial storage and transmission costs. To further reduce stored parameters, we introduce a "divide-and-share" paradigm that breaks the barriers of low-rank decomposition across matrix dimensions, modules, and layers by sharing parameters globally via a vector bank. As an instantiation of the paradigm to LoRA, our proposed VB-LoRA composites all the low-rank matrices of LoRA from a shared vector bank with a differentiable top-k admixture module. VB-LoRA achieves extreme parameter efficiency while maintaining comparable or better performance compared to state-of-the-art PEFT methods. Extensive experiments demonstrate the effectiveness of VB-LoRA on natural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leo-yangli/vb-lora
jaxOfficial

Videos

VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks· slideslive

Taxonomy

TopicsDigital Filter Design and Implementation · Advanced Wireless Communication Techniques · Advanced Data Compression Techniques