Post-Hoc Guidance for Consistency Models by Joint Flow Distribution Learning
Chia-Hong Hsu, Randall Balestriero

TL;DR
This paper introduces JFDL, a lightweight method that enables effective post-hoc guidance for pre-trained Consistency Models, matching the guidance flexibility of Diffusion Models without requiring a separate teacher model.
Contribution
JFDL allows guidance in pre-trained CMs without distillation, unlocking guided generation and reducing FID scores on CIFAR-10 and ImageNet datasets.
Findings
JFDL provides an adjustable guidance knob for CMs.
Guided images from JFDL have similar characteristics to CFG.
JFDL reduces FID scores on CIFAR-10 and ImageNet 64x64 datasets.
Abstract
Classifier-free Guidance (CFG) lets practitioners trade-off fidelity against diversity in Diffusion Models (DMs). The practicality of CFG is however hindered by DMs sampling cost. On the other hand, Consistency Models (CMs) generate images in one or a few steps, but existing guidance methods require knowledge distillation from a separate DM teacher, limiting CFG to Consistency Distillation (CD) methods. We propose Joint Flow Distribution Learning (JFDL), a lightweight alignment method enabling guidance in a pre-trained CM. With a pre-trained CM as an ordinary differential equation (ODE) solver, we verify with normality tests that the variance-exploding noise implied by the velocity fields from unconditional and conditional distributions is Gaussian. In practice, JFDL equips CMs with the familiar adjustable guidance knob, yielding guided images with similar characteristics to CFG.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
