DeLo: Dual Decomposed Low-Rank Experts Collaboration for Continual Missing Modality Learning
Xiwei Liu, Yulong Li, Feilong Tang, Imran Razzak

TL;DR
DeLo introduces a dual-decomposed low-rank expert architecture for continual missing modality learning, effectively addressing modality interference and catastrophic forgetting in large multimodal models through novel routing and memory mechanisms.
Contribution
This work presents the first dual-decomposed low-rank expert framework for CMML, improving upon prompt tuning and naive LoRA methods by reducing modality interference and enhancing continual learning.
Findings
Outperforms state-of-the-art CMML methods on benchmark datasets
Effectively handles modality incompleteness with Cross-Modal Guided Routing
Reduces catastrophic forgetting with task-partitioned architecture
Abstract
Adapting Large Multimodal Models (LMMs) to real-world scenarios poses the dual challenges of learning from sequential data streams while handling frequent modality incompleteness, a task known as Continual Missing Modality Learning (CMML). However, existing works on CMML have predominantly relied on prompt tuning, a technique that struggles with this task due to cross-task interference between its learnable prompts in their shared embedding space. A naive application of Low-Rank Adaptation (LoRA) with modality-shared module will also suffer modality interference from competing gradients. To this end, we propose DeLo, the first framework to leverage a novel dual-decomposed low-rank expert architecture for CMML. Specifically, this architecture resolves modality interference through decomposed LoRA expert, dynamically composing LoRA update matrix with rank-one factors from disentangled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning in Healthcare · Topic Modeling
