Critically-Damped Higher-Order Langevin Dynamics for Generative Modeling
Benjamin Sterling, Chad Gueli, and M\'onica F. Bugallo

TL;DR
This paper introduces a generalized critically-damped higher-order Langevin dynamics framework for diffusion models, providing optimal hyperparameters and closed-form solutions, leading to improved generative modeling performance.
Contribution
It extends critically-damped Langevin dynamics to arbitrary order, offering a new approach with analytical solutions and optimized hyperparameters for diffusion-based generative models.
Findings
Achieved improved FID scores on CIFAR-10 and CelebA-HQ datasets.
Provided closed-form solutions for the forward process mean and covariance.
Demonstrated the effectiveness of critical damping in higher-order Langevin dynamics.
Abstract
Denoising diffusion probabilistic models (DDPMs) represent an entirely new class of generative AI methods that have yet to be fully explored. They use Langevin dynamics, represented as stochastic differential equations, to describe a process that transforms data into noise, the forward process, and a process that transforms noise into generated data, the reverse process. Many of these methods utilize auxiliary variables that formulate the data as a ``position" variable, and the auxiliary variables are referred to as ``velocity", ``acceleration", etc. In this sense, it is possible to ``critically damp" the dynamics. Critical damping has been successfully introduced in Critically-Damped Langevin Dynamics (CLD) and Critically-Damped Third-Order Langevin Dynamics (TOLD++), but has not yet been applied to dynamics of arbitrary order. The proposed methodology generalizes Higher-Order Langevin…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Markov Chains and Monte Carlo Methods
