CP-Prompt: Composition-Based Cross-modal Prompting for   Domain-Incremental Continual Learning

Yu Feng; Zhen Tian; Yifan Zhu; Zongfu Han; Haoran Luo; Guangwei Zhang,; Meina Song

arXiv:2407.21043·cs.CL·August 5, 2024·1 cites

CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning

Yu Feng, Zhen Tian, Yifan Zhu, Zongfu Han, Haoran Luo, Guangwei Zhang,, Meina Song

PDF

Open Access 1 Repo

TL;DR

CP-Prompt introduces a novel composition-based prompting framework that enables continual learning across different domains in cross-modal settings, effectively reducing forgetting and improving performance.

Contribution

It proposes a simple, parameter-efficient method that captures intra-domain and inter-domain knowledge through compositional prompts for domain-incremental continual learning.

Findings

01

Outperforms state-of-the-art methods on three DIL tasks

02

Effectively reduces forgetting in cross-modal continual learning

03

Demonstrates superior domain adaptation capabilities

Abstract

The key challenge of cross-modal domain-incremental learning (DIL) is to enable the learning model to continuously learn from novel data with different feature distributions under the same task without forgetting old ones. However, existing top-performing methods still cause high forgetting rates, by lacking intra-domain knowledge extraction and inter-domain common prompting strategy. In this paper, we propose a simple yet effective framework, CP-Prompt, by training limited parameters to instruct a pre-trained model to learn new domains and avoid forgetting existing feature distributions. CP-Prompt captures intra-domain knowledge by compositionally inserting personalized prompts on multi-head self-attention layers and then learns the inter-domain knowledge with a common prompting strategy. CP-Prompt shows superiority compared with state-of-the-art baselines among three widely evaluated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dannis97500/cp_prompt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Speech and dialogue systems · Domain Adaptation and Few-Shot Learning