From Order to Distribution: A Spectral Characterization of Forgetting in Continual Learning
Zonghuan Xu, Xingjun Ma

TL;DR
This paper provides a spectral analysis of forgetting in continual learning by studying how the task distribution influences performance loss, offering a theoretical framework for understanding forgetting dynamics.
Contribution
It introduces a spectral characterization of forgetting based on task distribution, extending prior order-based analyses to a distributional perspective in linear models.
Findings
Derived an exact operator identity for forgetting.
Established an upper bound and asymptotic convergence rate.
Linked forgetting rate to geometric properties of the task distribution.
Abstract
A central challenge in continual learning is forgetting, the loss of performance on previously learned tasks induced by sequential adaptation to new ones. While forgetting has been extensively studied empirically, rigorous theoretical characterizations remain limited. A notable step in this direction is \citet{evron2022catastrophic}, which analyzes forgetting under random orderings of a fixed task collection in overparameterized linear regression. We shift the perspective from order to distribution. Rather than asking how a fixed task collection behaves under random orderings, we study an exact-fit linear regime in which tasks are sampled i.i.d.\ from a task distribution~, and ask how the generating distribution itself governs forgetting. In this setting, we derive an exact operator identity for the forgetting quantity, revealing a recursive spectral structure. Building on this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
