Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample   Optimization

Zichen Miao; Zhengyuan Yang; Kevin Lin; Ze Wang; Zicheng Liu; Lijuan; Wang; Qiang Qiu

arXiv:2410.03190·cs.CV·March 4, 2025

Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization

Zichen Miao, Zhengyuan Yang, Kevin Lin, Ze Wang, Zicheng Liu, Lijuan, Wang, Qiang Qiu

PDF

Open Access

TL;DR

This paper introduces pairwise sample optimization (PSO), a novel algorithm for fine-tuning timestep-distilled diffusion models, enabling improved image generation quality while maintaining low inference steps and computational efficiency.

Contribution

PSO provides a direct fine-tuning method for timestep-distilled diffusion models, enhancing their adaptability to preferences, styles, and concepts without extensive distillation retraining.

Findings

01

PSO improves human-preferred image generation.

02

PSO effectively adapts models for style transfer.

03

PSO maintains few-step generation capabilities.

Abstract

Recent advancements in timestep-distilled diffusion models have enabled high-quality image generation that rivals non-distilled multi-step models, but with significantly fewer inference steps. While such models are attractive for applications due to the low inference cost and latency, fine-tuning them with a naive diffusion objective would result in degraded and blurry outputs. An intuitive alternative is to repeat the diffusion distillation process with a fine-tuned teacher model, which produces good results but is cumbersome and computationally intensive; the distillation training usually requires magnitude higher of training compute compared to fine-tuning for specific image styles. In this paper, we present an algorithm named pairwise sample optimization (PSO), which enables the direct fine-tuning of an arbitrary timestep-distilled diffusion model. PSO introduces additional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsDiffusion