Three Creates All: You Only Sample 3 Steps

Yuren Cai; Guangyi Wang; Zongqing Li; Li Li; Zhihui Liu; and Songzhi Su

arXiv:2603.22375·cs.LG·March 25, 2026

Three Creates All: You Only Sample 3 Steps

Yuren Cai, Guangyi Wang, Zongqing Li, Li Li, Zhihui Liu, and Songzhi Su

PDF

Open Access

TL;DR

This paper introduces MTEO, a method that distills layer-wise time embeddings to enable fast, high-quality diffusion sampling with only three steps, without increasing inference time.

Contribution

MTEO is a novel, plug-and-play approach that distills small, layer-specific time embeddings, significantly improving few-step diffusion sampling performance.

Findings

01

Achieves state-of-the-art results in few-step sampling.

02

Narrowed the gap between distillation and lightweight methods.

03

No additional inference overhead introduced.

Abstract

Diffusion models deliver high-fidelity generation but remain slow at inference time due to many sequential network evaluations. We find that standard timestep conditioning becomes a key bottleneck for few-step sampling. Motivated by layer-dependent denoising dynamics, we propose Multi-layer Time Embedding Optimization (MTEO), which freeze the pretrained diffusion backbone and distill a small set of step-wise, layer-wise time embeddings from reference trajectories. MTEO is plug-and-play with existing ODE solvers, adds no inference-time overhead, and trains only a tiny fraction of parameters. Extensive experiments across diverse datasets and backbones show state-of-the-art performance in the few-step sampling and substantially narrow the gap between distillation-based and lightweight methods. Code will be available.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare · Domain Adaptation and Few-Shot Learning