Beyond Anti-Forgetting: Multimodal Continual Instruction Tuning with   Positive Forward Transfer

Junhao Zheng; Qianli Ma; Zhen Liu; Binquan Wu; Huawen Feng

arXiv:2401.09181·cs.LG·June 28, 2024·2 cites

Beyond Anti-Forgetting: Multimodal Continual Instruction Tuning with Positive Forward Transfer

Junhao Zheng, Qianli Ma, Zhen Liu, Binquan Wu, Huawen Feng

PDF

Open Access

TL;DR

This paper introduces Fwd-Prompt, a prompt-based method that enhances multimodal continual instruction tuning by reducing forgetting and negative transfer, enabling models to adapt efficiently to new tasks without retraining from scratch.

Contribution

The paper proposes Fwd-Prompt, a novel prompt tuning approach that minimizes task interference and reuses pre-trained knowledge, addressing catastrophic forgetting and negative transfer in MCIT.

Findings

01

Fwd-Prompt achieves state-of-the-art results in MCIT tasks.

02

It updates fewer parameters and requires no old task samples.

03

The method effectively reduces negative transfer and forgetting.

Abstract

Multimodal Continual Instruction Tuning (MCIT) enables Multimodal Large Language Models (MLLMs) to meet continuously emerging requirements without expensive retraining. MCIT faces two major obstacles: catastrophic forgetting (where old knowledge is forgotten) and negative forward transfer (where the performance of future tasks is degraded). Although existing methods have greatly alleviated catastrophic forgetting, they still suffer from negative forward transfer. We discover a large discrepancy in different input embeddings by performing singular value decomposition (SVD) on input embeddings. This discrepancy results in the model learning irrelevant information for old and pre-trained tasks, leading to catastrophic forgetting and negative forward transfer. To address these issues, we propose Prompt Tuning with Positive Forward Transfer (Fwd-Prompt), a prompt-based method that projects…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning