Visual Prompt Tuning in Null Space for Continual Learning

Yue Lu; Shizhou Zhang; De Cheng; Yinghui Xing; Nannan Wang; Peng Wang,; Yanning Zhang

arXiv:2406.05658·cs.CV·October 29, 2024

Visual Prompt Tuning in Null Space for Continual Learning

Yue Lu, Shizhou Zhang, De Cheng, Yinghui Xing, Nannan Wang, Peng Wang,, Yanning Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel method for continual learning in vision transformers by tuning prompts in the null space to prevent interference with previous tasks, backed by theoretical guarantees and extensive experiments.

Contribution

It proposes a null-space-based prompt tuning approach with theoretical analysis and practical approximation for continual learning in vision transformers.

Findings

01

Achieves superior performance on four class-incremental benchmarks.

02

Effectively prevents catastrophic forgetting in vision transformers.

03

Outperforms state-of-the-art methods in continual learning scenarios.

Abstract

Existing prompt-tuning methods have demonstrated impressive performances in continual learning (CL), by selecting and updating relevant prompts in the vision-transformer models. On the contrary, this paper aims to learn each task by tuning the prompts in the direction orthogonal to the subspace spanned by previous tasks' features, so as to ensure no interference on tasks that have been learned to overcome catastrophic forgetting in CL. However, different from the orthogonal projection in the traditional CNN architecture, the prompt gradient orthogonal projection in the ViT architecture shows completely different and greater challenges, i.e., 1) the high-order and non-linear self-attention operation; 2) the drift of prompt distribution brought by the LayerNorm in the transformer block. Theoretically, we have finally deduced two consistency conditions to achieve the prompt gradient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zugexiaodui/vptinnsforcl
pytorchOfficial

Videos

Visual Prompt Tuning in Null Space for Continual Learning· slideslive

Taxonomy

TopicsImage Processing Techniques and Applications · Experimental Learning in Engineering · Neural Networks and Applications