Adversarial Distribution Matching for Diffusion Distillation Towards Efficient Image and Video Synthesis

Yanzuo Lu; Yuxi Ren; Xin Xia; Shanchuan Lin; Xing Wang; Xuefeng Xiao; Andy J. Ma; Xiaohua Xie; Jian-Huang Lai

arXiv:2507.18569·cs.CV·July 25, 2025

Adversarial Distribution Matching for Diffusion Distillation Towards Efficient Image and Video Synthesis

Yanzuo Lu, Yuxi Ren, Xin Xia, Shanchuan Lin, Xing Wang, Xuefeng Xiao, Andy J. Ma, Xiaohua Xie, Jian-Huang Lai

PDF

Open Access

TL;DR

This paper introduces Adversarial Distribution Matching (ADM), a novel adversarial framework for diffusion distillation that improves efficiency and quality in image and video synthesis by addressing mode collapse issues in prior methods.

Contribution

The paper proposes ADM, a new adversarial distillation framework using diffusion-based discriminators, enhancing diffusion model compression and synthesis efficiency.

Findings

01

ADM outperforms previous methods in one-step distillation on SDXL.

02

The combined DMDX pipeline achieves better performance with less GPU time.

03

Multi-step ADM distillation sets new benchmarks in image and video synthesis.

Abstract

Distribution Matching Distillation (DMD) is a promising score distillation technique that compresses pre-trained teacher diffusion models into efficient one-step or multi-step student generators. Nevertheless, its reliance on the reverse Kullback-Leibler (KL) divergence minimization potentially induces mode collapse (or mode-seeking) in certain applications. To circumvent this inherent drawback, we propose Adversarial Distribution Matching (ADM), a novel framework that leverages diffusion-based discriminators to align the latent predictions between real and fake score estimators for score distillation in an adversarial manner. In the context of extremely challenging one-step distillation, we further improve the pre-trained generator by adversarial distillation with hybrid discriminators in both latent and pixel spaces. Different from the mean squared error used in DMD2 pre-training, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques