A Quantitative Characterization of Forgetting in Post-Training

Krishnakumar Balasubramanian; Shiva Prasad Kasiviswanathan

arXiv:2603.12163·cs.LG·March 13, 2026

A Quantitative Characterization of Forgetting in Post-Training

Krishnakumar Balasubramanian, Shiva Prasad Kasiviswanathan

PDF

Open Access

TL;DR

This paper provides a theoretical framework to understand and quantify forgetting in post-training generative models, analyzing how different objectives and replay strategies influence the retention of old information.

Contribution

It introduces a formalization of forgetting, analyzes the effects of divergence objectives and replay, and evaluates recent post-training methods through this theoretical lens.

Findings

01

Forward-KL drives old weights to zero, causing mass forgetting.

02

Reverse-KL preserves old components and reduces drift.

03

Replay strategies can mitigate forgetting depending on the divergence objective.

Abstract

Continual post-training of generative models is widely used, yet a principled understanding of when and why forgetting occurs remains limited. We develop theoretical results under a two-mode mixture abstraction (representing old and new tasks), proposed by Chen et al. (2025) (arXiv:2510.18874), and formalize forgetting in two forms: (i) mass forgetting, where the old mixture weight collapses to zero, and (ii) old-component drift, where an already-correct old component shifts during training. For equal-covariance Gaussian modes, we prove that forward-KL objectives trained on data from the new distribution drive the old weight to zero, while reverse-KL objectives converge to the true target (thereby avoiding mass forgetting) and perturb the old mean only through overlap-gated misassignment probabilities controlled by the Bhattacharyya coefficient, yielding drift that decays exponentially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Language and cultural evolution