Multi-Step Consistency Models: Fast Generation with Theoretical Guarantees

Nishant Jain; Xunpeng Huang; Yian Ma; Tong Zhang

arXiv:2505.01049·cs.LG·May 27, 2025

Multi-Step Consistency Models: Fast Generation with Theoretical Guarantees

Nishant Jain, Xunpeng Huang, Yian Ma, Tong Zhang

PDF

Open Access

TL;DR

This paper provides a theoretical foundation for consistency models, demonstrating they can generate high-quality samples rapidly with provable guarantees, even in non-smooth data settings, surpassing existing methods.

Contribution

It offers the first theoretical analysis of multi-step consistency models, establishing convergence rates and showing their efficiency and accuracy in both smooth and non-smooth data distributions.

Findings

01

Achieves KL divergence of O(ε^2) with O(log(d/ε)) steps.

02

In non-smooth data, convergence with O(d log(d/ε)) steps.

03

Best in class convergence rates for non-smooth cases.

Abstract

Consistency models have recently emerged as a compelling alternative to traditional SDE-based diffusion models. They offer a significant acceleration in generation by producing high-quality samples in very few steps. Despite their empirical success, a proper theoretic justification for their speed-up is still lacking. In this work, we address the gap by providing a theoretical analysis of consistency models capable of mapping inputs at a given time to arbitrary points along the reverse trajectory. We show that one can achieve a KL divergence of order $O (ε^{2})$ using only $O (lo g (\frac{d}{ε}))$ iterations with a constant step size. Additionally, under minimal assumptions on the data distribution (non smooth case) an increasingly common setting in recent diffusion model analyses we show that a similar KL convergence guarantee can be obtained,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications

MethodsConsistency Models · Diffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Network On Network