FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models
Changgu Chen, Libing Yang, Xiaoyan Yang, Lianggangxu Chen, Gaoqi He,, CHangbo Wang, Yang Li

TL;DR
FIND introduces a policy optimization framework to directly fine-tune the initial noise distribution in diffusion models, significantly improving prompt alignment and generation speed for image and video synthesis.
Contribution
It reformulates diffusion denoising as a one-step Markov decision process and employs policy optimization with stability mechanisms to enhance prompt consistency and efficiency.
Findings
Outperforms SOTA in text-to-image and text-to-video tasks
Achieves 10x faster generation than existing methods
Improves alignment between prompts and generated content
Abstract
In recent years, large-scale pre-trained diffusion models have demonstrated their outstanding capabilities in image and video generation tasks. However, existing models tend to produce visual objects commonly found in the training dataset, which diverges from user input prompts. The underlying reason behind the inaccurate generated results lies in the model's difficulty in sampling from specific intervals of the initial noise distribution corresponding to the prompt. Moreover, it is challenging to directly optimize the initial distribution, given that the diffusion process involves multiple denoising steps. In this paper, we introduce a Fine-tuning Initial Noise Distribution (FIND) framework with policy optimization, which unleashes the powerful potential of pre-trained diffusion networks by directly optimizing the initial distribution to align the generated contents with user-input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Stochastic processes and financial applications
MethodsDiffusion · ALIGN
