Efficient Diffusion Distillation via Embedding Loss

Jincheng Ying; Yitao Chen; Li Wenlin; Minghui Xu; Yinhao Xiao

arXiv:2604.22379·cs.CV·April 27, 2026

Efficient Diffusion Distillation via Embedding Loss

Jincheng Ying, Yitao Chen, Li Wenlin, Minghui Xu, Yinhao Xiao

PDF

TL;DR

This paper introduces Embedding Loss, a new method that improves diffusion model distillation by enhancing quality and reducing training time, especially for resource-limited settings.

Contribution

The paper proposes Embedding Loss, a novel feature embedding-based loss function that boosts diffusion model distillation efficiency and effectiveness across multiple datasets and frameworks.

Findings

01

Achieves state-of-the-art FID of 1.475 on CIFAR-10 for unconditional generation.

02

Reduces training iterations by up to 80%.

03

Demonstrates consistent improvements across datasets like ImageNet, AFHQ-v2, and FFHQ.

Abstract

Recent advances in distilling expensive diffusion models into efficient few-step generators show significant promise. However, these methods typically demand substantial computational resources and extended training periods, limiting accessibility for resource-constrained researchers, and existing supplementary loss functions have notable limitations. Regression loss requires pre-generating large datasets before training and limits the student model to the teacher's performance, while GAN-based losses suffer from training instability and require careful tuning. In this paper, we propose Embedding Loss (EL), a novel supplementary loss function that complements existing diffusion distillation methods to enhance generation quality and accelerate training with smaller batch sizes. Leveraging feature embeddings from a diverse set of randomly initialized networks, EL effectively aligns the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.