On the Edge of Memorization in Diffusion Models
Sam Buchanan, Druv Pai, Yi Ma, Valentin De Bortoli

TL;DR
This paper investigates when diffusion models memorize training data versus generalize beyond it, providing a theoretical framework and experimental validation for understanding the phase transition between these behaviors based on model size.
Contribution
It introduces a mathematical laboratory for analyzing memorization and generalization in diffusion models, and predicts the critical model size where memorization dominates.
Findings
Theoretical characterization of the crossover point for memorization versus generalization.
Experimental validation of the phase transition in diffusion models.
Analytical prediction of the model size threshold for memorization.
Abstract
When do diffusion models reproduce their training data, and when are they able to generate samples beyond it? A practically relevant theoretical understanding of this interplay between memorization and generalization may significantly impact real-world deployments of diffusion models with respect to issues such as copyright infringement and data privacy. In this work, to disentangle the different factors that influence memorization and generalization in practical diffusion models, we introduce a scientific and mathematical "laboratory" for investigating these phenomena in diffusion models trained on fully synthetic or natural image-like structured data. Within this setting, we hypothesize that the memorization or generalization behavior of an underparameterized trained model is determined by the difference in training loss between an associated memorizing model and a generalizing model.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
