DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic,   Lightweight Plugin for Large Language Models

Yuxuan Zhang; Ruizhe Li

arXiv:2410.01497·cs.CL·February 19, 2025

DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models

Yuxuan Zhang, Ruizhe Li

PDF

Open Access 1 Repo

TL;DR

DLP-LoRA introduces a dynamic, lightweight plugin for large language models that efficiently fuses multiple LoRAs at the sentence level, significantly improving multi-task performance with minimal inference overhead.

Contribution

The paper presents DLP-LoRA, a novel dynamic fusion method for multiple LoRAs using a small MLP, enabling efficient multi-task adaptation with reduced inference time.

Findings

01

Achieves 92.34% accuracy on multiple-choice tasks

02

Improves BLEU and ROUGE scores on QA datasets

03

Maintains inference time less than twice that of single LoRA

Abstract

Recent advancements in Large Language Models (LLMs) have achieved robust performance across diverse tasks, but fine-tuning these models for specific domains remains resource-intensive. Parameter-Efficient Fine-Tuning (PEFT) methods like Low-Rank Adaptation (LoRA) address this challenge by fine-tuning a small subset of parameters. However, existing methods for fusing multiple LoRAs lack dynamic fusion based on contextual inputs and often increase inference time due to token-level operations. We propose DLP-LoRA, a Dynamic Lightweight Plugin that employs a mini-MLP module with only 5M parameters to dynamically fuse multiple LoRAs at the sentence level using top-p sampling strategies. This approach reduces inference time to less than twice that of single LoRA inference by leveraging parallel computation. Evaluations across 26 tasks-including multiple-choice questions and question…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mecuping/dlp-lora
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis