DropLoRA: Sparse Low-Rank Adaptation for Parameter-Efficient Fine-Tuning
Haojie Zhang

TL;DR
DropLoRA introduces a pruning-based method that dynamically adapts the low-rank subspace in PEFT, significantly improving performance over traditional LoRA in various large language model tasks without extra costs.
Contribution
It proposes DropLoRA, a novel dynamic low-rank adaptation method that overcomes the static subspace limitation of LoRA, enhancing fine-tuning performance.
Findings
DropLoRA outperforms LoRA across multiple tasks.
It improves performance without additional training costs.
Effective on LLaMA models for diverse NLP tasks.
Abstract
LoRA-based large model parameter-efficient fine-tuning (PEFT) methods use low-rank de- composition to approximate updates to model parameters. However, compared to full- parameter fine-tuning, low-rank updates often lead to a performance gap in downstream tasks. To address this, we introduce DropLoRA, a novel pruning-based approach that focuses on pruning the rank dimension. Unlike conven- tional methods that attempt to overcome the low-rank bottleneck, DropLoRA innovatively integrates a pruning module between the two low-rank matrices in LoRA to simulate dy- namic subspace learning. This dynamic low- rank subspace learning allows DropLoRA to overcome the limitations of traditional LoRA, which operates within a static subspace. By continuously adapting the learning subspace, DropLoRA significantly boosts performance without incurring additional training or infer- ence costs. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
