Variational Diffusion Models
Diederik P. Kingma, Tim Salimans, Ben Poole, Jonathan Ho

TL;DR
This paper introduces a family of diffusion-based generative models that achieve state-of-the-art likelihoods on image benchmarks, with improved theoretical understanding and faster optimization methods.
Contribution
The authors develop a variational diffusion model with an efficient noise schedule optimization, theoretical insights into the variational lower bound, and state-of-the-art likelihood performance.
Findings
Achieved state-of-the-art likelihoods on image density benchmarks.
Proved equivalence between several diffusion models in literature.
Enabled faster training through optimized noise schedules.
Abstract
Diffusion-based generative models have demonstrated a capacity for perceptually impressive synthesis, but can they also be great likelihood-based models? We answer this in the affirmative, and introduce a family of diffusion-based generative models that obtain state-of-the-art likelihoods on standard image density estimation benchmarks. Unlike other diffusion-based models, our method allows for efficient optimization of the noise schedule jointly with the rest of the model. We show that the variational lower bound (VLB) simplifies to a remarkably short expression in terms of the signal-to-noise ratio of the diffused data, thereby improving our theoretical understanding of this model class. Using this insight, we prove an equivalence between several models proposed in the literature. In addition, we show that the continuous-time VLB is invariant to the noise schedule, except for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Music and Audio Processing
