RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness
Fanhu Zeng, Haiyang Guo, Fei Zhu, Li Shen, Hao Tang

TL;DR
RobustMerge is a training-free, parameter-efficient method for merging multi-task large language models, focusing on maintaining direction robustness through singular value compensation and cross-task normalization, leading to improved generalization.
Contribution
The paper introduces RobustMerge, a novel merging technique that preserves direction robustness without additional training, addressing limitations of existing full fine-tuning merging methods.
Findings
RobustMerge outperforms existing merging methods on diverse multimodal tasks.
It maintains direction robustness by compensating for singular value gaps.
The method enhances unseen task generalization and model stability.
Abstract
Fine-tuning pre-trained models with custom data leads to numerous expert models on specific tasks. Merging models into one universal model to empower multi-task ability refraining from data leakage has gained popularity. With the expansion in data and model size, parameter-efficient tuning becomes the common practice for obtaining task-specific models efficiently. However, few methods are dedicated to efficient merging, and existing methods designed for full fine-tuning merging fail under efficient merging. To address the issue, we analyze from low-rank decomposition and reveal that direction robustness during merging is crucial for merging efficient modules. We furthermore uncover that compensating for the gap between stark singular values contributes to direction robustness. Therefore, we propose RobustMerge, a training-free parameter-efficient merging method with complementary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗AuroraZengfh/LLaVA_7B_lora_r16_ImageNetmodel· 9 dl9 dl
- 🤗AuroraZengfh/LLaVA_7B_lora_r16_IconQAmodel· 6 dl6 dl
- 🤗AuroraZengfh/LLaVA_7B_lora_r16_RECmodel· 5 dl5 dl
- 🤗AuroraZengfh/LLaVA_7B_lora_r16_flickr30kmodel· 7 dl7 dl
- 🤗AuroraZengfh/LLaVA_7B_lora_r16_VQAv2model· 5 dl5 dl
- 🤗AuroraZengfh/LLaVA_7B_lora_r16_VizWizmodel· 6 dl6 dl
- 🤗AuroraZengfh/LLaVA_7B_lora_r16_OCRVQAmodel· 6 dl6 dl
- 🤗AuroraZengfh/LLaVA_7B_lora_r16_ScienceQAmodel· 6 dl· ♡ 16 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
