Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging

Haobo Zhang; Jiayu Zhou

arXiv:2505.22934·cs.CL·May 30, 2025

Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging

Haobo Zhang, Jiayu Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces OSRM, a method that constrains LoRA subspaces to improve multi-task model merging, reducing interference and maintaining accuracy across diverse datasets and models.

Contribution

It proposes a novel orthogonal subspace constraint for LoRA, enhancing robustness and compatibility of model merging without additional training.

Findings

01

Boosts merging performance across multiple datasets and models

02

Preserves single-task accuracy after merging

03

Increases robustness to hyperparameter variations

Abstract

Fine-tuning large language models (LMs) for individual tasks yields strong performance but is expensive for deployment and storage. Recent works explore model merging to combine multiple task-specific models into a single multi-task model without additional training. However, existing merging methods often fail for models fine-tuned with low-rank adaptation (LoRA), due to significant performance degradation. In this paper, we show that this issue arises from a previously overlooked interplay between model parameters and data distributions. We propose Orthogonal Subspaces for Robust model Merging (OSRM) to constrain the LoRA subspace *prior* to fine-tuning, ensuring that updates relevant to one task do not adversely shift outputs for others. Our approach can seamlessly integrate with most existing merging algorithms, reducing the unintended interference among tasks. Extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

illidanlab/osrm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications