SLCA++: Unleash the Power of Sequential Fine-tuning for Continual   Learning with Pre-training

Gengwei Zhang; Liyuan Wang; Guoliang Kang; Ling Chen; Yunchao Wei

arXiv:2408.08295·cs.CV·August 16, 2024

SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training

Gengwei Zhang, Liyuan Wang, Guoliang Kang, Ling Chen, Yunchao Wei

PDF

Open Access 1 Repo

TL;DR

This paper introduces SLCA++, a novel framework that enhances sequential fine-tuning for continual learning with pre-trained models, effectively mitigating overfitting and improving performance across image classification tasks.

Contribution

SLCA++ is the first to systematically analyze and address overfitting in sequential fine-tuning for continual learning, combining a slow learner and classifier alignment for superior results.

Findings

01

Outperforms state-of-the-art methods on multiple benchmarks.

02

Effectively mitigates overfitting in continual learning.

03

Provides a strong, practical baseline for future CLPT research.

Abstract

In recent years, continual learning with pre-training (CLPT) has received widespread interest, instead of its traditional focus of training from scratch. The use of strong pre-trained models (PTMs) can greatly facilitate knowledge transfer and alleviate catastrophic forgetting, but also suffers from progressive overfitting of pre-trained knowledge into specific downstream tasks. A majority of current efforts often keep the PTMs frozen and incorporate task-specific prompts to instruct representation learning, coupled with a prompt selection process for inference. However, due to the limited capacity of prompt parameters, this strategy demonstrates only sub-optimal performance in continual learning. In comparison, tuning all parameters of PTMs often provides the greatest potential for representation learning, making sequential fine-tuning (Seq FT) a fundamental baseline that has been…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gengdavid/slca
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Domain Adaptation and Few-Shot Learning

MethodsFocus · ALIGN