Label-Free Cross-Task LoRA Merging with Null-Space Compression
Wonyoung Lee, Wooseong Jeong, Kuk-Jin Yoon

TL;DR
This paper introduces Null-Space Compression (NSC) Merging, a label-free, output-agnostic method for model merging that effectively combines diverse tasks including classification, regression, and sequence generation.
Contribution
The paper proposes a novel NSC merging technique leveraging adapter geometry, enabling effective multi-task model merging without labels or task-specific tuning.
Findings
Achieves state-of-the-art results on twenty heterogeneous vision tasks.
Outperforms baselines on six NLI benchmarks.
Demonstrates scalability on vision-language tasks like VQA and image captioning.
Abstract
Model merging combines independently fine-tuned checkpoints without joint multi-task training. In the era of foundation-model, fine-tuning with Low-Rank Adaptation (LoRA) is prevalent, making LoRA merging a promising target. Existing approaches can work in homogeneous settings where all target tasks are classification but often fail when tasks span classification and regression. Approaches using entropy-based surrogates do not apply to regression and are costly for large language models due to long token sequences. We introduce Null-Space Compression (NSC) Merging, a label-free, output-agnostic method that sets merge weights from adapter geometry. Our key observation is that during LoRA finetuning the down-projection factor in compresses its null space, and the compression correlates with performance. NSC uses this as an optimization signal for merging that can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
