Class-Incremental Learning with CLIP: Adaptive Representation Adjustment and Parameter Fusion
Linlan Huang, Xusheng Cao, Haori Lu, Xialei Liu

TL;DR
This paper introduces RAPF, a novel method for class-incremental learning with CLIP that adaptively adjusts representations and fuses parameters to reduce forgetting and improve performance on benchmarks.
Contribution
The paper proposes RAPF, a new approach that measures class influence, adjusts representations, and fuses parameters to enhance CLIP's incremental learning capabilities.
Findings
Achieves state-of-the-art results on multiple benchmarks.
Effectively mitigates forgetting of old classes.
Demonstrates superior performance over existing methods.
Abstract
Class-incremental learning is a challenging problem, where the goal is to train a model that can classify data from an increasing number of classes over time. With the advancement of vision-language pre-trained models such as CLIP, they demonstrate good generalization ability that allows them to excel in class-incremental learning with completely frozen parameters. However, further adaptation to downstream tasks by simply fine-tuning the model leads to severe forgetting. Most existing works with pre-trained models assume that the forgetting of old classes is uniform when the model acquires new knowledge. In this paper, we propose a method named Adaptive Representation Adjustment and Parameter Fusion (RAPF). During training for new data, we measure the influence of new classes on old ones and adjust the representations, using textual features. After training, we employ a decomposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Artificial Intelligence in Healthcare
MethodsContrastive Language-Image Pre-training · Adapter
