Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Yifan Pu; Yizeng Han; Zhiwei Tang; Jiasheng Tang; Fan Wang; Bohan Zhuang; Gao Huang

arXiv:2512.13006·cs.CV·December 16, 2025

Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Yifan Pu, Yizeng Han, Zhiwei Tang, Jiasheng Tang, Fan Wang, Bohan Zhuang, Gao Huang

PDF

Open Access 1 Models

TL;DR

This paper systematically studies and adapts diffusion distillation techniques for open-ended text-to-image generation, providing practical guidelines and open-source tools to enable fast, high-quality, resource-efficient T2I models.

Contribution

It is the first comprehensive analysis of diffusion distillation methods applied to open-ended T2I generation, with practical recommendations and pretrained models.

Findings

01

Identified key obstacles in applying diffusion distillation to T2I.

02

Provided practical guidelines for input scaling, architecture, and hyperparameters.

03

Established a foundation for deploying efficient T2I diffusion models.

Abstract

Diffusion distillation has dramatically accelerated class-conditional image synthesis, but its applicability to open-ended text-to-image (T2I) generation is still unclear. We present the first systematic study that adapts and compares state-of-the-art distillation techniques on a strong T2I teacher model, FLUX.1-lite. By casting existing methods into a unified framework, we identify the key obstacles that arise when moving from discrete class labels to free-form language prompts. Beyond a thorough methodological analysis, we offer practical guidelines on input scaling, network architecture, and hyperparameters, accompanied by an open-source implementation and pretrained student models. Our findings establish a solid foundation for deploying fast, high-fidelity, and resource-efficient diffusion generators in real-world T2I applications. Code is available on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Alibaba-DAMO-Academy/T2I-Distill
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Advanced Neural Network Applications