The critical slowing down in diffusion models
Luca Maria Del Bono, Giulio Biroli, Patrick Charbonneau, Marylou Gabri\'e

TL;DR
This paper analyzes the critical slowing down phenomenon in diffusion models within a statistical physics framework, showing how architectural choices can significantly reduce training and sampling bottlenecks.
Contribution
It provides a theoretical analysis of critical slowing down in diffusion models and demonstrates how deeper architectures mitigate this issue.
Findings
Training time scales logarithmically with system size using two-layer networks.
Critical slowing down affects both training and sampling in diffusion models.
Local score approximations can accelerate training without increasing model complexity.
Abstract
Computational sampling has been central to the sciences since the mid-20th century. While machine-learning-based approaches have recently enabled major advances, their behavior remains poorly understood, with limited theoretical control over when and why they succeed. Here we provide such insight for diffusion models-a class of generative schemes highly effective in practice-by analyzing their application to the model of statistical field theory in the Gaussian limit . In this analytically tractable setting, we show that training a score model with a one-layer network architecture matching the exact solution exhibits a form of critical slowing down in parameter learning. This slowing down also impacts the generation process, indicating that the well-known difficulties of sampling near criticality persist even for learned generative models. To overcome this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
