Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
Zhanhao Liang, Yuhui Yuan, Shuyang Gu, Bohan Chen, Tiankai Hang,, Mingxi Cheng, Ji Li, Liang Zheng

TL;DR
This paper introduces step-by-step preference optimization (SPO) for diffusion models, enhancing aesthetic quality by focusing on fine-grained visual differences without sacrificing alignment, and demonstrating faster convergence and superior aesthetics.
Contribution
The paper proposes a novel SPO method that improves aesthetic generation in diffusion models by using step-aware preferences and fine-grained supervision, outperforming existing DPO techniques.
Findings
SPO significantly improves aesthetic quality over existing DPO methods.
SPO converges faster than traditional DPO due to better preference labeling.
Models fine-tuned with SPO maintain strong image-text alignment.
Abstract
Generating visually appealing images is fundamental to modern text-to-image generation models. A potential solution to better aesthetics is direct preference optimization (DPO), which has been applied to diffusion models to improve general image quality including prompt alignment and aesthetics. Popular DPO methods propagate preference labels from clean image pairs to all the intermediate steps along the two generation trajectories. However, preference labels provided in existing datasets are blended with layout and aesthetic opinions, which would disagree with aesthetic preference. Even if aesthetic labels were provided (at substantial cost), it would be hard for the two-trajectory methods to capture nuanced visual differences at different steps. To improve aesthetics economically, this paper uses existing generic preference data and introduces step-by-step preference optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗SPO-Diffusion-Models/SPO-SDXL_4k-p_10epmodel· 20 dl· ♡ 6120 dl♡ 61
- 🤗SPO-Diffusion-Models/SPO-SDXL_4k-p_10ep_LoRAmodel· ♡ 24♡ 24
- 🤗SPO-Diffusion-Models/SPO-SD-v1-5_4k-p_10epmodel· 27 dl· ♡ 627 dl♡ 6
- 🤗SPO-Diffusion-Models/SPO-SD-v1-5_4k-p_10ep_LoRAmodel· ♡ 10♡ 10
- 🤗SPO-Diffusion-Models/Step-Aware_Preference_Modelsmodel· ♡ 3♡ 3
- 🤗LyliaEngine/spo_sdxl_10ep_4k-data_lora_webuimodel· 183 dl183 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUrban Design and Spatial Analysis · Urban Planning and Valuation
MethodsFocus · Direct Preference Optimization · Diffusion
