Stabilizing, Scaling & Enhancing MeanFlow for Large-scale Diffusion Distillation

Xiao He; Yang Li; Peizhen Zhang; Songtao Liu; Zhao Zhong; Nannan Wang

arXiv:2605.17834·cs.CV·May 19, 2026

Stabilizing, Scaling & Enhancing MeanFlow for Large-scale Diffusion Distillation

Xiao He, Yang Li, Peizhen Zhang, Songtao Liu, Zhao Zhong, Nannan Wang

PDF

TL;DR

This paper proposes a stabilized and scalable MeanFlow distillation method for large-scale diffusion models, improving stability and performance in text-to-image generation tasks.

Contribution

It introduces a warm-up technique and trajectory distribution alignment to enhance MeanFlow's stability and effectiveness for large-scale diffusion model distillation.

Findings

01

Achieves superior distillation performance on 12B-parameter models.

02

Demonstrates robust generalization on 80B-parameter models.

03

Improves stability and reduces bias in diffusion model distillation.

Abstract

Diffusion models exhibit remarkable generative capability, but their high latency limits practical deployment. Many studies have attempted to reduce sampling steps to accelerate inference. Among them, MeanFlow has attracted considerable attention due to its concise formulation and remarkable performance. Nevertheless, the instability of its optimization objective and the ''mean-seeking bias'' have limited its applicability to distill large-scale industrial models. To stabilize MeanFlow for distilling large-scale models, we first introduce a warm-up technique, in which the original differential solution of MeanFlow is replaced by a discrete solution. This design avoids training collapse caused by the MeanFlow target containing a stop-gradient term from an undertrained model. Once the model acquires a preliminary ability to fit the average velocity field, we switch the optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.