Phased Consistency Models

Fu-Yun Wang; Zhaoyang Huang; Alexander William Bergman; Dazhong Shen,; Peng Gao; Michael Lingelbach; Keqiang Sun; Weikang Bian; Guanglu Song; Yu; Liu; Xiaogang Wang; Hongsheng Li

arXiv:2405.18407·cs.LG·December 5, 2024·1 cites

Phased Consistency Models

Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen,, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu, Liu, Xiaogang Wang, Hongsheng Li

PDF

Open Access 1 Repo 1 Models 1 Video

TL;DR

Phased Consistency Models (PCMs) improve high-resolution, text-conditioned image and video generation by addressing limitations of Latent Consistency Models, achieving superior multi-step refinement and competitive one-step results.

Contribution

Introduction of Phased Consistency Models that generalize and enhance Latent Consistency Models for better high-resolution, text-conditioned image and video generation.

Findings

01

PCMs outperform LCMs in 1--16 step generation.

02

PCMs achieve comparable 1-step results to state-of-the-art methods.

03

PCMs enable state-of-the-art few-step text-to-video generation.

Abstract

Consistency Models (CMs) have made significant progress in accelerating the generation of diffusion models. However, their application to high-resolution, text-conditioned image generation in the latent space remains unsatisfactory. In this paper, we identify three key flaws in the current design of Latent Consistency Models (LCMs). We investigate the reasons behind these limitations and propose Phased Consistency Models (PCMs), which generalize the design space and address the identified limitations. Our evaluations demonstrate that PCMs outperform LCMs across 1--16 step generation settings. While PCMs are specifically designed for multi-step refinement, they achieve comparable 1-step generation results to previously state-of-the-art specifically designed 1-step methods. Furthermore, we show the methodology of PCMs is versatile and applicable to video generation, enabling us to train…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

G-U-N/Phased-Consistency-Model
pytorchOfficial

Models

🤗
wangfuyun/PCM_Weights
model· 136 dl· ♡ 99
136 dl♡ 99

Videos

Phased Consistency Models· slideslive

Taxonomy

TopicsComplex Systems and Decision Making

MethodsConsistency Models · Diffusion