Multi-Step Consistency Models: Fast Generation with Theoretical Guarantees
Nishant Jain, Xunpeng Huang, Yian Ma, Tong Zhang

TL;DR
This paper provides a theoretical foundation for consistency models, demonstrating they can generate high-quality samples rapidly with provable guarantees, even in non-smooth data settings, surpassing existing methods.
Contribution
It offers the first theoretical analysis of multi-step consistency models, establishing convergence rates and showing their efficiency and accuracy in both smooth and non-smooth data distributions.
Findings
Achieves KL divergence of O(ε^2) with O(log(d/ε)) steps.
In non-smooth data, convergence with O(d log(d/ε)) steps.
Best in class convergence rates for non-smooth cases.
Abstract
Consistency models have recently emerged as a compelling alternative to traditional SDE-based diffusion models. They offer a significant acceleration in generation by producing high-quality samples in very few steps. Despite their empirical success, a proper theoretic justification for their speed-up is still lacking. In this work, we address the gap by providing a theoretical analysis of consistency models capable of mapping inputs at a given time to arbitrary points along the reverse trajectory. We show that one can achieve a KL divergence of order using only iterations with a constant step size. Additionally, under minimal assumptions on the data distribution (non smooth case) an increasingly common setting in recent diffusion model analyses we show that a similar KL convergence guarantee can be obtained,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications
MethodsConsistency Models · Diffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Network On Network
