AdvDMD: Adversarial Reward Meets DMD For High-Quality Few-Step Generation

Xu Wang; Zexian Li; Litong Gong; Tiezheng Ge; Zhijie Deng

arXiv:2604.28126·cs.CV·May 1, 2026

AdvDMD: Adversarial Reward Meets DMD For High-Quality Few-Step Generation

Xu Wang, Zexian Li, Litong Gong, Tiezheng Ge, Zhijie Deng

PDF

TL;DR

AdvDMD introduces a unified approach combining Distribution Matching Distillation and reinforcement learning, utilizing an adversarial discriminator for high-quality few-step image generation in diffusion models.

Contribution

It proposes a seamless integration of DMD and RL with an adversarial discriminator, improving efficiency and performance in few-step diffusion model sampling.

Findings

01

4-step AdvDMD outperforms 40-step SD3.5 on DPG-Bench.

02

Significant performance gains for SD3 on GenEval.

03

2-step AdvDMD surpasses TwinFlow on Qwen-Image.

Abstract

Diffusion models offer superior generation quality at the expense of extensive sampling steps. Distillation methods, with Distribution Matching Distillation (DMD) as a popular example, can mitigate this issue, but performance degradation remains pronounced when sampling steps are limited. Reinforcement learning (RL) has been leveraged to improve the few-step generation quality during distillation, with the potential to even surpass the performance of the teacher model. However, existing approaches are combinatorial in nature, merely integrating an RL process with the distillation process, which introduces unnecessary complexities. To address this gap, we propose AdvDMD, a method that seamlessly unifies DMD distillation and RL. Specifically, AdvDMD employs the adversarially trained discriminator from DMD2 as the reward model, which assigns low scores to generated images and high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.