Distribution Backtracking Builds A Faster Convergence Trajectory for Diffusion Distillation
Shengyuan Zhang, Ling Yang, Zejian Li, An Zhao, Chenye Meng, Changyuan, Yang, Guang Yang, Zhiyuan Yang, Lingyun Sun

TL;DR
This paper introduces Distribution Backtracking Distillation (DisBack), a novel method that leverages the entire convergence trajectory of teacher diffusion models to accelerate and improve the distillation process for faster image generation.
Contribution
DisBack extends score distillation by incorporating the convergence trajectory, enabling faster and more effective diffusion model distillation with comparable quality.
Findings
DisBack achieves faster convergence than existing methods.
DisBack attains a low FID score of 1.38 on ImageNet 64x64.
The method is easy to implement and generalizes well to other distillation approaches.
Abstract
Accelerating the sampling speed of diffusion models remains a significant challenge. Recent score distillation methods distill a heavy teacher model into a student generator to achieve one-step generation, which is optimized by calculating the difference between the two score functions on the samples generated by the student model. However, there is a score mismatch issue in the early stage of the distillation process, because existing methods mainly focus on using the endpoint of pre-trained diffusion models as teacher models, overlooking the importance of the convergence trajectory between the student generator and the teacher model. To address this issue, we extend the score distillation process by introducing the entire convergence trajectory of teacher models and propose Distribution Backtracking Distillation (DisBack). DisBask is composed of two stages: Degradation Recording and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProcess Optimization and Integration · Advanced Control Systems Optimization
MethodsDiffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Focus
