Visual Prompt Tuning in Null Space for Continual Learning
Yue Lu, Shizhou Zhang, De Cheng, Yinghui Xing, Nannan Wang, Peng Wang,, Yanning Zhang

TL;DR
This paper introduces a novel method for continual learning in vision transformers by tuning prompts in the null space to prevent interference with previous tasks, backed by theoretical guarantees and extensive experiments.
Contribution
It proposes a null-space-based prompt tuning approach with theoretical analysis and practical approximation for continual learning in vision transformers.
Findings
Achieves superior performance on four class-incremental benchmarks.
Effectively prevents catastrophic forgetting in vision transformers.
Outperforms state-of-the-art methods in continual learning scenarios.
Abstract
Existing prompt-tuning methods have demonstrated impressive performances in continual learning (CL), by selecting and updating relevant prompts in the vision-transformer models. On the contrary, this paper aims to learn each task by tuning the prompts in the direction orthogonal to the subspace spanned by previous tasks' features, so as to ensure no interference on tasks that have been learned to overcome catastrophic forgetting in CL. However, different from the orthogonal projection in the traditional CNN architecture, the prompt gradient orthogonal projection in the ViT architecture shows completely different and greater challenges, i.e., 1) the high-order and non-linear self-attention operation; 2) the drift of prompt distribution brought by the LayerNorm in the transformer block. Theoretically, we have finally deduced two consistency conditions to achieve the prompt gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsImage Processing Techniques and Applications · Experimental Learning in Engineering · Neural Networks and Applications
