Consistency Models

Yang Song; Prafulla Dhariwal; Mark Chen; Ilya Sutskever

arXiv:2303.01469·cs.LG·June 1, 2023·24 cites

Consistency Models

Yang Song, Prafulla Dhariwal, Mark Chen, Ilya Sutskever

PDF

Open Access 5 Repos 10 Models

TL;DR

Consistency models are a new type of generative model that enable fast, high-quality data generation and editing by directly mapping noise to data, outperforming previous diffusion-based methods in speed and quality.

Contribution

This paper introduces consistency models, a novel family of models that support one-step generation and zero-shot editing, trained via distillation or as standalone models, surpassing existing methods in benchmarks.

Findings

01

Achieve state-of-the-art FID of 3.55 on CIFAR-10 with one-step generation.

02

Outperform existing diffusion distillation techniques in one- and few-step sampling.

03

Can serve as standalone generative models surpassing traditional non-adversarial models.

Abstract

Diffusion models have significantly advanced the fields of image, audio, and video generation, but they depend on an iterative sampling process that causes slow generation. To overcome this limitation, we propose consistency models, a new family of models that generate high quality samples by directly mapping noise to data. They support fast one-step generation by design, while still allowing multistep sampling to trade compute for sample quality. They also support zero-shot data editing, such as image inpainting, colorization, and super-resolution, without requiring explicit training on these tasks. Consistency models can be trained either by distilling pre-trained diffusion models, or as standalone generative models altogether. Through extensive experiments, we demonstrate that they outperform existing distillation techniques for diffusion models in one- and few-step sampling,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Cell Image Analysis Techniques

MethodsConsistency Models · Diffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings