Continual Learning with Pre-Trained Models: A Survey

Da-Wei Zhou; Hai-Long Sun; Jingyi Ning; Han-Jia Ye; De-Chuan Zhan

arXiv:2401.16386·cs.LG·April 24, 2024·1 cites

Continual Learning with Pre-Trained Models: A Survey

Da-Wei Zhou, Hai-Long Sun, Jingyi Ning, Han-Jia Ye, De-Chuan Zhan

PDF

Open Access 2 Repos

TL;DR

This survey reviews recent advancements in continual learning that leverage pre-trained models, categorizing methodologies, comparing their strengths and weaknesses, and providing empirical evaluations to address fairness concerns.

Contribution

It offers a comprehensive categorization of PTM-based continual learning methods and includes an empirical study comparing state-of-the-art approaches.

Findings

01

Categorized PTM-based CL methods into three groups

02

Provided a comparative analysis of methodologies

03

Highlighted fairness issues in current evaluations

Abstract

Nowadays, real-world applications often face streaming data, which requires the learning system to absorb new knowledge as data evolves. Continual Learning (CL) aims to achieve this goal and meanwhile overcome the catastrophic forgetting of former knowledge when learning new ones. Typical CL methods build the model from scratch to grow with incoming data. However, the advent of the pre-trained model (PTM) era has sparked immense research interest, particularly in leveraging PTMs' robust representational capabilities. This paper presents a comprehensive survey of the latest advancements in PTM-based CL. We categorize existing methodologies into three distinct groups, providing a comparative analysis of their similarities, differences, and respective advantages and disadvantages. Additionally, we offer an empirical study contrasting various state-of-the-art methods to highlight concerns…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis