Provable Effects of Data Replay in Continual Learning: A Feature Learning Perspective
Meng Ding, Jinhui Xu, Kaiyi Ji

TL;DR
This paper provides a theoretical analysis of full data replay in continual learning, revealing how signal-to-noise ratio influences forgetting and task recovery, and highlights the importance of task ordering.
Contribution
It introduces a comprehensive theoretical framework for data replay in continual learning, emphasizing the role of SNR and task order in mitigating forgetting.
Findings
Forgetting occurs when noise dominates signal in later tasks.
Sufficient signal accumulation enables recovery of earlier tasks.
Prioritizing high-signal tasks helps prevent catastrophic forgetting.
Abstract
Continual learning (CL) aims to train models on a sequence of tasks while retaining performance on previously learned ones. A core challenge in this setting is catastrophic forgetting, where new learning interferes with past knowledge. Among various mitigation strategies, data-replay methods, where past samples are periodically revisited, are considered simple yet effective, especially when memory constraints are relaxed. However, the theoretical effectiveness of full data replay, where all past data is accessible during training, remains largely unexplored. In this paper, we present a comprehensive theoretical framework for analyzing full data-replay training in continual learning from a feature learning perspective. Adopting a multi-view data model, we identify the signal-to-noise ratio (SNR) as a critical factor affecting forgetting. Focusing on task-incremental binary classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis
