Mitigating Catastrophic Forgetting in Streaming Generative and Predictive Learning via Stateful Replay
Wenzhang Du

TL;DR
This paper investigates how stateful replay can mitigate catastrophic forgetting in streaming learning tasks across generative and predictive models, providing a unified analysis and empirical validation on multiple datasets.
Contribution
It introduces a unified gradient alignment analysis for replay and fine-tuning, demonstrating the effectiveness of stateful replay across diverse streaming scenarios.
Findings
Replay reduces forgetting by 2-3 times on multi-task streams.
Stateful replay performs comparably to fine-tuning on time-based streams.
The study provides a unified framework for understanding replay in streaming learning.
Abstract
Many deployed learning systems must update models on streaming data under memory constraints. The default strategy, sequential fine-tuning on each new phase, is architecture-agnostic but often suffers catastrophic forgetting when later phases correspond to different sub-populations or tasks. Replay with a finite buffer is a simple alternative, yet its behaviour across generative and predictive objectives is not well understood. We present a unified study of stateful replay for streaming autoencoding, time series forecasting, and classification. We view both sequential fine-tuning and replay as stochastic gradient methods for an ideal joint objective, and use a gradient alignment analysis to show when mixing current and historical samples should reduce forgetting. We then evaluate a single replay mechanism on six streaming scenarios built from Rotated MNIST, ElectricityLoadDiagrams…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Domain Adaptation and Few-Shot Learning · Time Series Analysis and Forecasting
