Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
Zichen Miao, Zhengyuan Yang, Kevin Lin, Ze Wang, Zicheng Liu, Lijuan, Wang, Qiang Qiu

TL;DR
This paper introduces pairwise sample optimization (PSO), a novel algorithm for fine-tuning timestep-distilled diffusion models, enabling improved image generation quality while maintaining low inference steps and computational efficiency.
Contribution
PSO provides a direct fine-tuning method for timestep-distilled diffusion models, enhancing their adaptability to preferences, styles, and concepts without extensive distillation retraining.
Findings
PSO improves human-preferred image generation.
PSO effectively adapts models for style transfer.
PSO maintains few-step generation capabilities.
Abstract
Recent advancements in timestep-distilled diffusion models have enabled high-quality image generation that rivals non-distilled multi-step models, but with significantly fewer inference steps. While such models are attractive for applications due to the low inference cost and latency, fine-tuning them with a naive diffusion objective would result in degraded and blurry outputs. An intuitive alternative is to repeat the diffusion distillation process with a fine-tuned teacher model, which produces good results but is cumbersome and computationally intensive; the distillation training usually requires magnitude higher of training compute compared to fine-tuning for specific image styles. In this paper, we present an algorithm named pairwise sample optimization (PSO), which enables the direct fine-tuning of an arbitrary timestep-distilled diffusion model. PSO introduces additional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
