Regularization Shortcomings for Continual Learning

Timoth\'ee Lesort; Andrei Stoian; David Filliat

arXiv:1912.03049·cs.LG·April 6, 2021·20 cites

Regularization Shortcomings for Continual Learning

Timoth\'ee Lesort, Andrei Stoian, David Filliat

PDF

Open Access

TL;DR

This paper critically examines regularization-based continual learning methods, revealing their inability to distinguish classes across tasks in class-incremental scenarios, with implications for reinforcement learning and pre-trained models.

Contribution

It provides a theoretical analysis demonstrating the limitations of regularization approaches in continual learning, supported by experiments and practical implications.

Findings

01

Regularization methods fail to discriminate classes across tasks in class-incremental learning.

02

This shortcoming affects continual reinforcement learning and pre-trained models.

03

Understanding these limitations can guide more effective continual learning strategies.

Abstract

In most machine learning algorithms, training data is assumed to be independent and identically distributed (iid). When it is not the case, the algorithm's performances are challenged, leading to the famous phenomenon of catastrophic forgetting. Algorithms dealing with it are gathered in the Continual Learning research field. In this paper, we study the regularization based approaches to continual learning and show that those approaches can not learn to discriminate classes from different tasks in an elemental continual benchmark: the class-incremental scenario. We make theoretical reasoning to prove this shortcoming and illustrate it with examples and experiments. Moreover, we show that it can have some important consequences on continual multi-tasks reinforcement learning or in pre-trained models used for continual learning. We believe that highlighting and understanding the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications