TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance

Minghao Fu; Guo-Hua Wang; Xiaohao Chen; Qing-Guo Chen; Zhao Xu; Weihua Luo; Kaifu Zhang

arXiv:2507.18192·cs.CV·July 28, 2025

TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance

Minghao Fu, Guo-Hua Wang, Xiaohao Chen, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang

PDF

Open Access 1 Models

TL;DR

TeEFusion is a distillation method that efficiently mimics complex guidance strategies in text-to-image models, enabling faster inference without sacrificing image quality.

Contribution

It introduces a simple fusion of text embeddings to replicate guidance effects, reducing inference costs in text-to-image synthesis.

Findings

01

Student models achieve up to 6x faster inference

02

Maintains comparable image quality to teacher models

03

Effective distillation of complex sampling strategies

Abstract

Recent advances in text-to-image synthesis largely benefit from sophisticated sampling strategies and classifier-free guidance (CFG) to ensure high-quality generation. However, CFG's reliance on two forward passes, especially when combined with intricate sampling algorithms, results in prohibitively high inference costs. To address this, we introduce TeEFusion (Text Embeddings Fusion), a novel and efficient distillation method that directly incorporates the guidance magnitude into the text embeddings and distills the teacher model's complex sampling strategy. By simply fusing conditional and unconditional text embeddings using linear operations, TeEFusion reconstructs the desired guidance without adding extra parameters, simultaneously enabling the student model to learn from the teacher's output produced via its sophisticated sampling approach. Extensive experiments on state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
AIDC-AI/TeEFusion
model· 5 dl· ♡ 3
5 dl♡ 3

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning