Accelerating Diffusion Models with One-to-Many Knowledge Distillation
Linfeng Zhang, Kaisheng Ma

TL;DR
This paper introduces one-to-many knowledge distillation (O2MKD), a novel method to accelerate diffusion models by training multiple student models on different timestep subsets, significantly reducing computational costs for real-time image generation.
Contribution
The paper proposes a new distillation approach that trains multiple student diffusion models on different timestep subsets, enhancing speed without sacrificing quality.
Findings
O2MKD achieves significant acceleration on multiple datasets.
The method improves efficiency of existing knowledge distillation and sampling techniques.
Experimental results demonstrate effective speedup with maintained image quality.
Abstract
Significant advancements in image generation have been made with diffusion models. Nevertheless, when contrasted with previous generative models, diffusion models face substantial computational overhead, leading to failure in real-time generation. Recent approaches have aimed to accelerate diffusion models by reducing the number of sampling steps through improved sampling techniques or step distillation. However, the methods to diminish the computational cost for each timestep remain a relatively unexplored area. Observing the fact that diffusion models exhibit varying input distributions and feature distributions at different timesteps, we introduce one-to-many knowledge distillation (O2MKD), which distills a single teacher diffusion model into multiple student diffusion models, where each student diffusion model is trained to learn the teacher's knowledge for a subset of continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsKnowledge Distillation · Diffusion
