Stabilizing, Scaling & Enhancing MeanFlow for Large-scale Diffusion Distillation
Xiao He, Yang Li, Peizhen Zhang, Songtao Liu, Zhao Zhong, Nannan Wang

TL;DR
This paper proposes a stabilized and scalable MeanFlow distillation method for large-scale diffusion models, improving stability and performance in text-to-image generation tasks.
Contribution
It introduces a warm-up technique and trajectory distribution alignment to enhance MeanFlow's stability and effectiveness for large-scale diffusion model distillation.
Findings
Achieves superior distillation performance on 12B-parameter models.
Demonstrates robust generalization on 80B-parameter models.
Improves stability and reduces bias in diffusion model distillation.
Abstract
Diffusion models exhibit remarkable generative capability, but their high latency limits practical deployment. Many studies have attempted to reduce sampling steps to accelerate inference. Among them, MeanFlow has attracted considerable attention due to its concise formulation and remarkable performance. Nevertheless, the instability of its optimization objective and the ''mean-seeking bias'' have limited its applicability to distill large-scale industrial models. To stabilize MeanFlow for distilling large-scale models, we first introduce a warm-up technique, in which the original differential solution of MeanFlow is replaced by a discrete solution. This design avoids training collapse caused by the MeanFlow target containing a stop-gradient term from an undertrained model. Once the model acquires a preliminary ability to fit the average velocity field, we switch the optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
