Improving Diffusion Generalization with Weak-to-Strong Segmented Guidance
Liangyu Yuan, Yufei Huang, Mingkun Lei, Tong Zhao, Ruoyu Wang, Changxi Chi, Yiwei Wang, Chi Zhang

TL;DR
This paper introduces a hybrid guidance method called SGG based on the weak-to-strong principle, improving diffusion model generalization during sampling and training, validated through extensive experiments.
Contribution
The paper proposes SGG, a novel hybrid guidance technique for diffusion models, and demonstrates its effectiveness in both inference and training settings, enhancing generalization.
Findings
SGG outperforms existing guidance methods at inference.
Training with W2S principle improves diffusion model generalization.
SGG achieves performance gains in both conditional and unconditional tasks.
Abstract
Diffusion models generate synthetic images through an iterative refinement process. However, the misalignment between the simulation-free objective and the iterative process often causes accumulated gradient error along the sampling trajectory, which leads to unsatisfactory results and a failure to generalize. Guidance techniques like Classifier Free Guidance (CFG) and AutoGuidance (AG) alleviate this by extrapolating between the main and inferior signal for stronger generalization. Despite empirical success, the effective operational regimes of prevalent guidance methods are still under-explored, leading to ambiguity when selecting the appropriate guidance method given a precondition. In this work, we first conduct synthetic comparisons to isolate and demonstrate the effective regime of guidance methods represented by CFG and AG from the perspective of weak-to-strong principle. Based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · 3D Shape Modeling and Analysis
