A Quantitative Characterization of Forgetting in Post-Training
Krishnakumar Balasubramanian, Shiva Prasad Kasiviswanathan

TL;DR
This paper provides a theoretical framework to understand and quantify forgetting in post-training generative models, analyzing how different objectives and replay strategies influence the retention of old information.
Contribution
It introduces a formalization of forgetting, analyzes the effects of divergence objectives and replay, and evaluates recent post-training methods through this theoretical lens.
Findings
Forward-KL drives old weights to zero, causing mass forgetting.
Reverse-KL preserves old components and reduces drift.
Replay strategies can mitigate forgetting depending on the divergence objective.
Abstract
Continual post-training of generative models is widely used, yet a principled understanding of when and why forgetting occurs remains limited. We develop theoretical results under a two-mode mixture abstraction (representing old and new tasks), proposed by Chen et al. (2025) (arXiv:2510.18874), and formalize forgetting in two forms: (i) mass forgetting, where the old mixture weight collapses to zero, and (ii) old-component drift, where an already-correct old component shifts during training. For equal-covariance Gaussian modes, we prove that forward-KL objectives trained on data from the new distribution drive the old weight to zero, while reverse-KL objectives converge to the true target (thereby avoiding mass forgetting) and perturb the old mean only through overlap-gated misassignment probabilities controlled by the Bhattacharyya coefficient, yielding drift that decays exponentially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Language and cultural evolution
