Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin, Micha\"el Gharbi, Taesung Park, Richard Zhang, Eli, Shechtman, Fredo Durand, William T. Freeman

TL;DR
This paper introduces DMD2, an improved distillation method for fast image synthesis that eliminates costly training components, integrates GAN loss, and enables multi-step sampling, achieving state-of-the-art results with significantly reduced inference costs.
Contribution
DMD2 advances distribution matching distillation by removing the regression loss, incorporating GAN training, and addressing training-inference mismatch, leading to superior image generation quality and efficiency.
Findings
Achieved FID of 1.28 on ImageNet-64x64.
Surpassed original teacher model in quality.
Enabled megapixel image generation with few steps.
Abstract
Recent approaches have shown promises distilling diffusion models into efficient one-step generators. Among them, Distribution Matching Distillation (DMD) produces one-step generators that match their teacher in distribution, without enforcing a one-to-one correspondence with the sampling trajectories of their teachers. However, to ensure stable training, DMD requires an additional regression loss computed using a large set of noise-image pairs generated by the teacher with many steps of a deterministic sampler. This is costly for large-scale text-to-image synthesis and limits the student's quality, tying it too closely to the teacher's original sampling paths. We introduce DMD2, a set of techniques that lift this limitation and improve DMD training. First, we eliminate the regression loss and the need for expensive dataset construction. We show that the resulting instability is due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Processing Techniques and Applications · Advanced Vision and Imaging
MethodsSparse Evolutionary Training · Diffusion
