Maximum Likelihood Training of Score-Based Diffusion Models

Yang Song; Conor Durkan; Iain Murray; Stefano Ermon

arXiv:2101.09258·stat.ML·October 22, 2021·46 cites

Maximum Likelihood Training of Score-Based Diffusion Models

Yang Song, Conor Durkan, Iain Murray, Stefano Ermon

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces a method to train score-based diffusion models using maximum likelihood, leading to improved likelihood scores and competitive performance on standard datasets.

Contribution

It demonstrates that a specific weighting scheme allows approximate maximum likelihood training of score-based diffusion models, bridging a gap in their training methodology.

Findings

01

Maximum likelihood training improves model likelihood across datasets.

02

Achieved state-of-the-art likelihood scores on CIFAR-10 and ImageNet 32x32.

03

Models perform competitively without data augmentation.

Abstract

Score-based diffusion models synthesize samples by reversing a stochastic process that diffuses data to noise, and are trained by minimizing a weighted combination of score matching losses. The log-likelihood of score-based diffusion models can be tractably computed through a connection to continuous normalizing flows, but log-likelihood is not directly optimized by the weighted combination of score matching losses. We show that for a specific weighting scheme, the objective upper bounds the negative log-likelihood, thus enabling approximate maximum likelihood training of score-based diffusion models. We empirically observe that maximum likelihood training consistently improves the likelihood of score-based diffusion models across multiple datasets, stochastic processes, and model architectures. Our best models achieve negative log-likelihoods of 2.83 and 3.76 bits/dim on CIFAR-10 and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Maximum Likelihood Training of Score-Based Diffusion Models· slideslive

Taxonomy

TopicsAdvanced Neuroimaging Techniques and Applications · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks

MethodsDiffusion