Beyond Anti-Forgetting: Multimodal Continual Instruction Tuning with Positive Forward Transfer
Junhao Zheng, Qianli Ma, Zhen Liu, Binquan Wu, Huawen Feng

TL;DR
This paper introduces Fwd-Prompt, a prompt-based method that enhances multimodal continual instruction tuning by reducing forgetting and negative transfer, enabling models to adapt efficiently to new tasks without retraining from scratch.
Contribution
The paper proposes Fwd-Prompt, a novel prompt tuning approach that minimizes task interference and reuses pre-trained knowledge, addressing catastrophic forgetting and negative transfer in MCIT.
Findings
Fwd-Prompt achieves state-of-the-art results in MCIT tasks.
It updates fewer parameters and requires no old task samples.
The method effectively reduces negative transfer and forgetting.
Abstract
Multimodal Continual Instruction Tuning (MCIT) enables Multimodal Large Language Models (MLLMs) to meet continuously emerging requirements without expensive retraining. MCIT faces two major obstacles: catastrophic forgetting (where old knowledge is forgotten) and negative forward transfer (where the performance of future tasks is degraded). Although existing methods have greatly alleviated catastrophic forgetting, they still suffer from negative forward transfer. We discover a large discrepancy in different input embeddings by performing singular value decomposition (SVD) on input embeddings. This discrepancy results in the model learning irrelevant information for old and pre-trained tasks, leading to catastrophic forgetting and negative forward transfer. To address these issues, we propose Prompt Tuning with Positive Forward Transfer (Fwd-Prompt), a prompt-based method that projects…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning
