AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs

Xuyang Wei; Chunlin Tian; Li Li

arXiv:2502.20035·cs.CV·February 28, 2025

AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs

Xuyang Wei, Chunlin Tian, Li Li

PDF

Open Access 1 Repo

TL;DR

AsymLoRA is a novel parameter-efficient fine-tuning method for MLLMs that effectively manages data conflicts and commonalities, improving performance across diverse multimodal tasks.

Contribution

It introduces asymmetric LoRA with task-specific and shared projections to unify handling of conflicting and common dataset features in MLLMs.

Findings

01

Outperforms vanilla LoRA and LoRA-MoE in accuracy

02

Enhances model efficiency and adaptability

03

Achieves superior results on multiple benchmarks

Abstract

Effective instruction fine-tuning on diverse image-text datasets is crucial for developing a versatile Multimodal Large Language Model (MLLM), where dataset composition dictates the model's adaptability across multimodal tasks. However, complex datasets often contain inherent conflicts -- stemming from modality-specific optimization objectives -- and latent commonalities that enable cross-task transfer, which most existing approaches handle separately. To bridge this gap, we introduce AsymLoRA, a parameter-efficient tuning framework that unifies knowledge modularization and cross-modal coordination via asymmetric LoRA: task-specific low-rank projections (matrix B) that preserve distinct adaptation pathways for conflicting objectives, and a shared projection (matrix A) that consolidates cross-modal commonalities. Extensive evaluations demonstrate that AsymLoRA consistently surpasses both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clin0212/hydralora
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning