Unveiling the Tapestry: the Interplay of Generalization and Forgetting in Continual Learning
Zenglin Shi, Jing Jie, Ying Sun, Joo Hwee Lim, Mengmi Zhang

TL;DR
This paper explores the relationship between generalization and forgetting in continual learning, providing empirical evidence of their positive interaction and introducing a novel regularization technique, STCR, that improves both generalization and knowledge retention.
Contribution
It presents the first empirical study demonstrating the mutual benefits of generalization and forgetting in continual learning and proposes STCR, a simple method that enhances both aspects simultaneously.
Findings
STCR improves continual learning performance significantly.
The mutual positive effect between generalization and forgetting is empirically validated.
STCR can be integrated with existing methods for better results.
Abstract
In AI, generalization refers to a model's ability to perform well on out-of-distribution data related to the given task, beyond the data it was trained on. For an AI agent to excel, it must also possess the continual learning capability, whereby an agent incrementally learns to perform a sequence of tasks without forgetting the previously acquired knowledge to solve the old tasks. Intuitively, generalization within a task allows the model to learn underlying features that can readily be applied to novel tasks, facilitating quicker learning and enhanced performance in subsequent tasks within a continual learning framework. Conversely, continual learning methods often include mechanisms to mitigate catastrophic forgetting, ensuring that knowledge from earlier tasks is retained. This preservation of knowledge over tasks plays a role in enhancing generalization for the ongoing task at hand.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
