Unveiling the Tapestry: the Interplay of Generalization and Forgetting   in Continual Learning

Zenglin Shi; Jing Jie; Ying Sun; Joo Hwee Lim; Mengmi Zhang

arXiv:2211.11174·cs.CV·March 25, 2025·1 cites

Unveiling the Tapestry: the Interplay of Generalization and Forgetting in Continual Learning

Zenglin Shi, Jing Jie, Ying Sun, Joo Hwee Lim, Mengmi Zhang

PDF

Open Access

TL;DR

This paper explores the relationship between generalization and forgetting in continual learning, providing empirical evidence of their positive interaction and introducing a novel regularization technique, STCR, that improves both generalization and knowledge retention.

Contribution

It presents the first empirical study demonstrating the mutual benefits of generalization and forgetting in continual learning and proposes STCR, a simple method that enhances both aspects simultaneously.

Findings

01

STCR improves continual learning performance significantly.

02

The mutual positive effect between generalization and forgetting is empirically validated.

03

STCR can be integrated with existing methods for better results.

Abstract

In AI, generalization refers to a model's ability to perform well on out-of-distribution data related to the given task, beyond the data it was trained on. For an AI agent to excel, it must also possess the continual learning capability, whereby an agent incrementally learns to perform a sequence of tasks without forgetting the previously acquired knowledge to solve the old tasks. Intuitively, generalization within a task allows the model to learn underlying features that can readily be applied to novel tasks, facilitating quicker learning and enhanced performance in subsequent tasks within a continual learning framework. Conversely, continual learning methods often include mechanisms to mitigate catastrophic forgetting, ensuring that knowledge from earlier tasks is retained. This preservation of knowledge over tasks plays a role in enhancing generalization for the ongoing task at hand.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications