Consistency Models
Yang Song, Prafulla Dhariwal, Mark Chen, Ilya Sutskever

TL;DR
Consistency models are a new type of generative model that enable fast, high-quality data generation and editing by directly mapping noise to data, outperforming previous diffusion-based methods in speed and quality.
Contribution
This paper introduces consistency models, a novel family of models that support one-step generation and zero-shot editing, trained via distillation or as standalone models, surpassing existing methods in benchmarks.
Findings
Achieve state-of-the-art FID of 3.55 on CIFAR-10 with one-step generation.
Outperform existing diffusion distillation techniques in one- and few-step sampling.
Can serve as standalone generative models surpassing traditional non-adversarial models.
Abstract
Diffusion models have significantly advanced the fields of image, audio, and video generation, but they depend on an iterative sampling process that causes slow generation. To overcome this limitation, we propose consistency models, a new family of models that generate high quality samples by directly mapping noise to data. They support fast one-step generation by design, while still allowing multistep sampling to trade compute for sample quality. They also support zero-shot data editing, such as image inpainting, colorization, and super-resolution, without requiring explicit training on these tasks. Consistency models can be trained either by distilling pre-trained diffusion models, or as standalone generative models altogether. Through extensive experiments, we demonstrate that they outperform existing distillation techniques for diffusion models in one- and few-step sampling,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗dg845/consistency-model-pipelinesmodel· 28 dl· ♡ 128 dl♡ 1
- 🤗dg845/diffusers-cd_bedroom256_l2model· 5 dl5 dl
- 🤗dg845/diffusers-cd_cat256_l2model· 2 dl2 dl
- 🤗dg845/diffusers-cd_imagenet64_lpipsmodel· 6 dl6 dl
- 🤗dg845/diffusers-ct_imagenet64model· 1 dl1 dl
- 🤗dg845/diffusers-cd_bedroom256_lpipsmodel· 1 dl1 dl
- 🤗dg845/diffusers-ct_bedroom256model· 3 dl3 dl
- 🤗dg845/diffusers-cd_cat256_lpipsmodel· 5 dl5 dl
- 🤗dg845/diffusers-ct_cat256model· 5 dl5 dl
- 🤗openai/diffusers-ct_cat256model· 5 dl· ♡ 35 dl♡ 3
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Cell Image Analysis Techniques
MethodsConsistency Models · Diffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
