Decouple and Orthogonalize: A Data-Free Framework for LoRA Merging

Shenghe Zheng; Hongzhi Wang; Chenyu Huang; Xiaohui Wang; Tao Chen; Jiayuan Fan; Shuyue Hu; Peng Ye

arXiv:2505.15875·cs.CV·May 23, 2025

Decouple and Orthogonalize: A Data-Free Framework for LoRA Merging

Shenghe Zheng, Hongzhi Wang, Chenyu Huang, Xiaohui Wang, Tao Chen, Jiayuan Fan, Shuyue Hu, Peng Ye

PDF

Open Access

TL;DR

This paper introduces DO-Merging, a novel data-free framework that decouples and orthogonalizes parameters to improve model merging performance, especially for LoRA modules, across various domains.

Contribution

It proposes a decoupled and orthogonal merging approach that addresses magnitude variance issues in LoRA, with theoretical guarantees and broad applicability.

Findings

01

Significantly higher performance than existing methods.

02

Effective across vision, language, and multi-modal tasks.

03

Minimal additional computational cost.

Abstract

With more open-source models available for diverse tasks, model merging has gained attention by combining models into one, reducing training, storage, and inference costs. Current research mainly focuses on model merging for full fine-tuning, overlooking the popular LoRA. However, our empirical analysis reveals that: a) existing merging methods designed for full fine-tuning perform poorly on LoRA; b) LoRA modules show much larger parameter magnitude variance than full fine-tuned weights; c) greater parameter magnitude variance correlates with worse merging performance. Considering that large magnitude variances cause deviations in the distribution of the merged parameters, resulting in information loss and performance degradation, we propose a Decoupled and Orthogonal merging approach(DO-Merging). By separating parameters into magnitude and direction components and merging them…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsSoftmax · Attention Is All You Need