TLCM: Training-efficient Latent Consistency Model for Image Generation   with 2-8 Steps

Qingsong Xie; Zhenyi Liao; Zhijie Deng; Chen chen; Haonan Lu

arXiv:2406.05768·cs.CV·November 8, 2024

TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps

Qingsong Xie, Zhenyi Liao, Zhijie Deng, Chen chen, Haonan Lu

PDF

Open Access 1 Repo 1 Models

TL;DR

TLCM is a training-efficient, data-free latent consistency model that accelerates image generation with 2-8 inference steps, maintaining high quality and flexibility, and surpassing existing models in various benchmarks.

Contribution

The paper introduces a novel data-free, multi-step latent consistency distillation method that significantly reduces training time and inference steps while preserving image quality.

Findings

01

Achieves high-quality image generation with only 2-8 steps.

02

Surpasses existing accelerated models in CLIP and aesthetic scores.

03

Demonstrates versatility in style transfer and controllable generation.

Abstract

Distilling latent diffusion models (LDMs) into ones that are fast to sample from is attracting growing research interest. However, the majority of existing methods face two critical challenges: (1) They hinge on long training using a huge volume of real data. (2) They routinely lead to quality degradation for generation, especially in text-image alignment. This paper proposes a novel training-efficient Latent Consistency Model (TLCM) to overcome these challenges. Our method first accelerates LDMs via data-free multistep latent consistency distillation (MLCD), and then data-free latent consistency distillation is proposed to efficiently guarantee the inter-segment consistency in MLCD. Furthermore, we introduce bags of techniques, e.g., distribution matching, adversarial learning, and preference learning, to enhance TLCM's performance at few-step inference without any real data. TLCM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oppo-mente-lab/tlcm
pytorchOfficial

Models

🤗
OPPOer/TLCM
model· ♡ 3
♡ 3

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsContrastive Language-Image Pre-training · Diffusion