Compensating Distribution Drifts in Class-incremental Learning of Pre-trained Vision Transformers
Xuan Rao, Simian Xu, Zheng Li, Bo Zhao, Derong Liu, Mingming Ha, Cesare Alippi

TL;DR
This paper introduces SLDC, a method to mitigate distribution drift in class-incremental learning with pre-trained vision transformers, significantly improving sequential fine-tuning performance by aligning feature distributions across tasks.
Contribution
The paper proposes a novel SLDC approach with linear and weakly nonlinear variants to address distribution drift in CIL of ViTs, enhancing sequential learning effectiveness.
Findings
SLDC improves classifier performance in CIL benchmarks.
Combining KD with SLDC achieves results comparable to joint training.
SLDC effectively aligns feature distributions across tasks.
Abstract
Recent advances have shown that sequential fine-tuning (SeqFT) of pre-trained vision transformers (ViTs), followed by classifier refinement using approximate distributions of class features, can be an effective strategy for class-incremental learning (CIL). However, this approach is susceptible to distribution drift, caused by the sequential optimization of shared backbone parameters. This results in a mismatch between the distributions of the previously learned classes and that of the updater model, ultimately degrading the effectiveness of classifier performance over time. To address this issue, we introduce a latent space transition operator and propose Sequential Learning with Drift Compensation (SLDC). SLDC aims to align feature distributions across tasks to mitigate the impact of drift. First, we present a linear variant of SLDC, which learns a linear operator by solving a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Data Stream Mining Techniques · Advanced Neural Network Applications
