ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation

Yurun Song; Junchen Zhao; Ian G. Harris; Sangeetha Abdu Jyothi

arXiv:2406.10785·cs.CL·May 20, 2025

ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation

Yurun Song, Junchen Zhao, Ian G. Harris, Sangeetha Abdu Jyothi

PDF

Open Access 1 Repo

TL;DR

ShareLoRA is a novel fine-tuning method for large language models that significantly reduces trainable parameters and memory usage by sharing low-rank matrices, while maintaining or improving performance and robustness across diverse tasks and models.

Contribution

ShareLoRA introduces a shared low-rank adaptation technique that enhances parameter efficiency and robustness in LLM fine-tuning, outperforming standard LoRA in various scenarios.

Findings

01

Reduces trainable parameters by 44-96%

02

Maintains or improves model performance across tasks

03

Enhances robustness and generalization in diverse models

Abstract

In this paper, we introduce \textbf{Share}d \textbf{Lo}w \textbf{R}ank \textbf{A}daptation (ShareLoRA), a Large Language Model (LLM) fine-tuning technique that balances parameter efficiency, adaptability, and robustness without compromising performance. By strategically sharing the low-rank weight matrices across different layers, ShareLoRA achieves 44\% to 96\% reduction in trainable parameters compared to standard LoRA, alongside a substantial decrease in memory overhead. This efficiency gain scales with model size, making ShareLoRA particularly advantageous for resource-constrained environments. Importantly, ShareLoRA not only maintains model performance but also exhibits robustness in both classification and generation tasks across diverse models, including RoBERTa, GPT-2, and LLaMA series (1, 2, and 3). It consistently outperforms LoRA in zero-shot, few-shot, and continual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Rain9876/ShareLoRA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · WordPiece · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Dropout