Efficient Diffusion Distillation via Embedding Loss
Jincheng Ying, Yitao Chen, Li Wenlin, Minghui Xu, Yinhao Xiao

TL;DR
This paper introduces Embedding Loss, a new method that improves diffusion model distillation by enhancing quality and reducing training time, especially for resource-limited settings.
Contribution
The paper proposes Embedding Loss, a novel feature embedding-based loss function that boosts diffusion model distillation efficiency and effectiveness across multiple datasets and frameworks.
Findings
Achieves state-of-the-art FID of 1.475 on CIFAR-10 for unconditional generation.
Reduces training iterations by up to 80%.
Demonstrates consistent improvements across datasets like ImageNet, AFHQ-v2, and FFHQ.
Abstract
Recent advances in distilling expensive diffusion models into efficient few-step generators show significant promise. However, these methods typically demand substantial computational resources and extended training periods, limiting accessibility for resource-constrained researchers, and existing supplementary loss functions have notable limitations. Regression loss requires pre-generating large datasets before training and limits the student model to the teacher's performance, while GAN-based losses suffer from training instability and require careful tuning. In this paper, we propose Embedding Loss (EL), a novel supplementary loss function that complements existing diffusion distillation methods to enhance generation quality and accelerate training with smaller batch sizes. Leveraging feature embeddings from a diverse set of randomly initialized networks, EL effectively aligns the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
