CASP: Few-Shot Class-Incremental Learning with CLS Token Attention Steering Prompts

Shuai Huang; Xuhan Lin; Yuwu Lu

arXiv:2601.16773·cs.CV·January 26, 2026

CASP: Few-Shot Class-Incremental Learning with CLS Token Attention Steering Prompts

Shuai Huang, Xuhan Lin, Yuwu Lu

PDF

Open Access

TL;DR

This paper introduces CASP, a novel prompt-based method leveraging CLS token attention steering, to improve few-shot class-incremental learning by enhancing transferability, generalization, and reducing parameter overhead.

Contribution

The paper proposes CLS Token Attention Steering Prompts (CASP), a new approach that modulates self-attention with trainable biases and uses data augmentation strategies for better FSCIL performance.

Findings

01

CASP outperforms state-of-the-art methods on multiple datasets.

02

CASP does not require fine-tuning during incremental phases.

03

CASP significantly reduces parameter overhead.

Abstract

Few-shot class-incremental learning (FSCIL) presents a core challenge in continual learning, requiring models to rapidly adapt to new classes with very limited samples while mitigating catastrophic forgetting. Recent prompt-based methods, which integrate pretrained backbones with task-specific prompts, have made notable progress. However, under extreme few-shot incremental settings, the model's ability to transfer and generalize becomes critical, and it is thus essential to leverage pretrained knowledge to learn feature representations that can be shared across future categories during the base session. Inspired by the mechanism of the CLS token, which is similar to human attention and progressively filters out task-irrelevant information, we propose the CLS Token Attention Steering Prompts (CASP). This approach introduces class-shared trainable bias parameters into the query, key, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis