TL;DR
This paper introduces ENMP, an evolutionary pruning method that identifies and removes negative LoRA modules to improve the merging of multiple LoRA experts, enhancing multi-task model performance.
Contribution
ENMP is a novel evolutionary search-based pruning technique that effectively excludes detrimental modules before merging, leading to state-of-the-art results in language and vision tasks.
Findings
ENMP consistently improves merging performance across tasks.
It achieves state-of-the-art results in language and vision domains.
The method effectively identifies negative modules that degrade performance.
Abstract
Merging multiple Low-Rank Adaptation (LoRA) experts into a single backbone is a promising approach for efficient multi-task deployment. While existing methods strive to alleviate interference via weight interpolation or subspace alignment, they rest upon the implicit assumption that all LoRA matrices contribute constructively to the merged model. In this paper, we uncover a critical bottleneck in current merging paradigms: the existence of -- specific LoRA layers that inherently degrade global performance upon merging. We propose volutionary egative odule runing (), a plug-and-play LoRA pruning method to locate and exclude these detrimental modules prior to merging. By leveraging an evolutionary search strategy, ENMP effectively navigates the discrete, non-differentiable landscape of module…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
