Memorization and Regularization in Generative Diffusion Models

Ricardo Baptista; Agnimitra Dasgupta; Nikola B. Kovachki; Assad; Oberai; and Andrew M. Stuart

arXiv:2501.15785·cs.LG·March 19, 2025

Memorization and Regularization in Generative Diffusion Models

Ricardo Baptista, Agnimitra Dasgupta, Nikola B. Kovachki, Assad, Oberai, and Andrew M. Stuart

PDF

Open Access 1 Repo

TL;DR

This paper analyzes how diffusion models memorize training data and explores regularization techniques to prevent this, providing a theoretical foundation and empirical evaluation of methods like Tikhonov regularization and early stopping.

Contribution

It offers a theoretical analysis of memorization in diffusion models and investigates regularization strategies to mitigate it, advancing understanding of model generalization.

Findings

01

Regularization can prevent memorization in diffusion models.

02

Tikhonov regularization promotes better generalization.

03

Early stopping and under-parameterization reduce data memorization.

Abstract

Diffusion models have emerged as a powerful framework for generative modeling. At the heart of the methodology is score matching: learning gradients of families of log-densities for noisy versions of the data distribution at different scales. When the loss function adopted in score matching is evaluated using empirical data, rather than the population loss, the minimizer corresponds to the score of a time-dependent Gaussian mixture. However, use of this analytically tractable minimizer leads to data memorization: in both unconditioned and conditioned settings, the generative model returns the training samples. This paper contains an analysis of the dynamical mechanism underlying memorization. The analysis highlights the need for regularization to avoid reproducing the analytically tractable minimizer; and, in so doing, lays the foundations for a principled understanding of how to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

baptistar/DiffusionModelDynamics
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Mathematical Modeling in Engineering

MethodsEarly Stopping