Inversion-DPO: Precise and Efficient Post-Training for Diffusion Models

Zejian Li; Yize Li; Chenye Meng; Zhongni Liu; Yang Ling; Shengyuan Zhang; Guang Yang; Changyuan Yang; Zhiyuan Yang; Lingyun Sun

arXiv:2507.11554·cs.CV·August 5, 2025

Inversion-DPO: Precise and Efficient Post-Training for Diffusion Models

Zejian Li, Yize Li, Chenye Meng, Zhongni Liu, Yang Ling, Shengyuan Zhang, Guang Yang, Changyuan Yang, Zhiyuan Yang, Lingyun Sun

PDF

Open Access 2 Models

TL;DR

Inversion-DPO introduces a new post-training framework for diffusion models that improves alignment accuracy and efficiency by eliminating reward models through DDIM inversion, enabling high-quality, compositional image generation.

Contribution

It reformulates Direct Preference Optimization with DDIM inversion, providing a reward-model-free, more precise, and efficient post-training method for diffusion models.

Findings

01

Significant performance improvements over existing methods.

02

High-fidelity, compositionally coherent image generation.

03

Effective for complex structural image tasks.

Abstract

Recent advancements in diffusion models (DMs) have been propelled by alignment methods that post-train models to better conform to human preferences. However, these approaches typically require computation-intensive training of a base model and a reward model, which not only incurs substantial computational overhead but may also compromise model accuracy and training efficiency. To address these limitations, we propose Inversion-DPO, a novel alignment framework that circumvents reward modeling by reformulating Direct Preference Optimization (DPO) with DDIM inversion for DMs. Our method conducts intractable posterior sampling in Diffusion-DPO with the deterministic inversion from winning and losing samples to noise and thus derive a new post-training paradigm. This paradigm eliminates the need for auxiliary reward models or inaccurate appromixation, significantly enhancing both precision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks