EM Distillation for One-step Diffusion Models

Sirui Xie; Zhisheng Xiao; Diederik P Kingma; Tingbo Hou; Ying Nian Wu,; Kevin Patrick Murphy; Tim Salimans; Ben Poole; Ruiqi Gao

arXiv:2405.16852·cs.LG·December 9, 2024

EM Distillation for One-step Diffusion Models

Sirui Xie, Zhisheng Xiao, Diederik P Kingma, Tingbo Hou, Ying Nian Wu,, Kevin Patrick Murphy, Tim Salimans, Ben Poole, Ruiqi Gao

PDF

Open Access 1 Video

TL;DR

EM Distillation (EMD) is a novel maximum likelihood approach that efficiently converts diffusion models into one-step generators, maintaining high perceptual quality and outperforming existing methods on standard benchmarks.

Contribution

The paper introduces EM Distillation, a new EM-based method for distilling diffusion models into one-step generators with minimal quality loss and improved stability.

Findings

01

EMD achieves better FID scores on ImageNet-64 and ImageNet-128.

02

It outperforms existing one-step generative methods.

03

It effectively distills text-to-image diffusion models.

Abstract

While diffusion models can learn complex distributions, sampling requires a computationally expensive iterative process. Existing distillation methods enable efficient sampling, but have notable limitations, such as performance degradation with very few sampling steps, reliance on training data access, or mode-seeking optimization that may fail to capture the full distribution. We propose EM Distillation (EMD), a maximum likelihood-based approach that distills a diffusion model to a one-step generator model with minimal loss of perceptual quality. Our approach is derived through the lens of Expectation-Maximization (EM), where the generator parameters are updated using samples from the joint distribution of the diffusion teacher prior and inferred generator latents. We develop a reparametrized sampling scheme and a noise cancellation technique that together stabilizes the distillation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

EM Distillation for One-step Diffusion Models· slideslive

Taxonomy

TopicsExtraction and Separation Processes

MethodsDiffusion