On the Design of One-step Diffusion via Shortcutting Flow Paths
Haitao Lin, Peiyan Hu, Minsi Ren, Zhifeng Gao, Zhi-Ming Ma, Guolin ke, Tailin Wu, Stan Z. Li

TL;DR
This paper introduces a unified framework for designing one-step diffusion models that shortcut the diffusion process, providing theoretical insights and practical improvements, leading to state-of-the-art results on ImageNet-256x256.
Contribution
It offers a systematic design framework for shortcut diffusion models, clarifying their components and enabling principled improvements without pre-training or distillation.
Findings
Achieved FID50k of 2.85 on ImageNet-256x256 with one-step generation.
Reduced training steps to 2x while maintaining high image quality.
Provided a theoretical foundation for shortcut diffusion model design.
Abstract
Recent advances in few-step diffusion models have demonstrated their efficiency and effectiveness by shortcutting the probabilistic paths of diffusion models, especially in training one-step diffusion models from scratch (\emph{a.k.a.} shortcut models). However, their theoretical derivation and practical implementation are often closely coupled, which obscures the design space. To address this, we propose a common design framework for representative shortcut models. This framework provides theoretical justification for their validity and disentangles concrete component-level choices, thereby enabling systematic identification of improvements. With our proposed improvements, the resulting one-step model achieves a new state-of-the-art FID50k of 2.85 on ImageNet-256x256 under the classifier-free guidance setting with one step generation, and further reaches FID50k of 2.53 with 2x training…
Peer Reviews
Decision·ICLR 2026 Poster
+ The paper presents a formalization of shortcut models within a unified framework. This can provide a valuable foundation for subsequent work. + The paper elucidates the design space of SC models. Both the mathematical formulation and the overall writing quality are presented with clarity. + The paper makes several theoretical contributions that will likely benefit future research. These include: i) A Wasserstein distance bound for the objectives of discrete-time (DT-SC) and continuous-time (CT
- The empirical results indicate that the proposed plug-in velocity yields marginal performance gains, suggesting that its practical benefits over existing methods may be limited. - The experimental comparison would be strengthened by the inclusion of other state-of-the-art baselines, such as rectified flow and reflow, for a more comprehensive evaluation. - The improvement techniques presented appear to be specific to the MeanFlow architecture. The paper's impact could be broadened by explorin
- This paper provides a comprehensive framework for a series of shortcut model methods, offering effective tools for analyzing this family of approaches. - The analysis of each component is detailed and well-structured. By disentangling the individual components, the paper makes the design space considerably more transparent. Theoretical analyses are extensive and appear to be well-structured. - The proposed method achieves SOTA results on one-step ImageNet-256×256 generation, demonstrating th
- According to Figure 3, it is difficult to claim that “the convergence of FID50k during training is substantially faster with the class-consistent mini-batching technique.” More evidence is needed to support this statement. - As shown in Figure 6, at the XL scale, the proposed method achieves worse FID under 2-NFE compared to 1-NFE. This might indicate a potential scalability issue of the proposed approach. - There are a few minor typo errors. For example, in line 198, $X^\theta_{t,r}(xt)$ shou
- The paper is clearly written and easy to follow. - While the work does not introduce a fundamentally new method and offers only moderate novelty, it contributes a valuable unifying perspective on ODE-based one-step diffusion models. - The theoretical contributions are strong: Theorem 2.2 establishes error bounds for both DTSC and CTSC; Proposition 3.1 presents an insightful bias–variance analysis clarifying when CTSC outperforms DTSC; and Theorem C.7 theoretically justifies why linear paths ar
- Limited Novelty of Core Framework: While the unified view is valuable for understanding, this is already well established in previous work Flow Map Matching[1,2], which further hurt the contribution of "propose a common design framework for representative shortcut models". -from discrete to continuous, one can either set $s= t−dt$ to get backward formula such as MeanFlow and sCM; or set $s=r+dt$ to obtain the forward formula such as AlighYourFlow [3]. - Empirical Gaps: - Slow convergen
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques
