Self-Guidance: Boosting Flow and Diffusion Generation on Their Own
Tiancheng Li, Weijian Luo, Zhiyang Chen, Liyuan Ma, Guo-Jun Qi

TL;DR
This paper introduces Self-Guidance (SG), a flexible, guidance-only method that enhances diffusion and flow-based image generation quality without retraining, effectively suppressing low-quality outputs and improving physiological accuracy.
Contribution
The paper proposes a novel guidance technique, Self-Guidance, that improves generation quality using only the model's score function at different noise levels, without additional training.
Findings
SG outperforms existing algorithms on FID and Human Preference Score.
SG-prev achieves 50% more efficiency than SG.
Both methods improve physiological correctness of generated human body parts.
Abstract
Proper guidance strategies are essential to achieve high-quality generation results without retraining diffusion and flow-based text-to-image models. Existing guidance either requires specific training or strong inductive biases of diffusion model networks, which potentially limits their ability and application scope. Motivated by the observation that artifact outliers can be detected by a significant decline in the density from a noisier to a cleaner noise level, we propose Self-Guidance (SG), which can significantly improve the quality of the generated image by suppressing the generation of low-quality samples. The biggest difference from existing guidance is that SG only relies on the sampling score function of the original diffusion or flow model at different noise levels, with no need for any tricky and expensive guidance-specific training. This makes SG highly flexible to be used…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications
MethodsDiffusion
