TL;DR
This paper introduces a novel $f$-divergence framework for distilling diffusion models into single-step generators, improving mode coverage and generation speed over traditional KL-based methods.
Contribution
It generalizes distribution matching in diffusion model distillation using $f$-divergences, deriving gradient expressions and demonstrating superior empirical performance.
Findings
Alternative $f$-divergences outperform reverse-KL in image generation.
Jensen-Shannon divergence achieves state-of-the-art one-step generation results.
The framework unifies and extends existing variational score distillation methods.
Abstract
Sampling from diffusion models involves a slow iterative process that hinders their practical deployment, especially for interactive applications. To accelerate generation speed, recent approaches distill a multi-step diffusion model into a single-step student generator via variational score distillation, which matches the distribution of samples generated by the student to the teacher's distribution. However, these approaches use the reverse Kullback-Leibler (KL) divergence for distribution matching which is known to be mode seeking. In this paper, we generalize the distribution matching approach using a novel -divergence minimization framework, termed -distill, that covers different divergences with different trade-offs in terms of mode coverage and training variance. We derive the gradient of the -divergence between the teacher and student distributions and show that it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
