Boost Your Human Image Generation Model via Direct Preference   Optimization

Sanghyeon Na; Yonggyu Kim; Hyunjoon Lee

arXiv:2405.20216·cs.CV·April 10, 2025

Boost Your Human Image Generation Model via Direct Preference Optimization

Sanghyeon Na, Yonggyu Kim, Hyunjoon Lee

PDF

Open Access

TL;DR

This paper introduces HG-DPO, an improved direct preference optimization method that uses real images and curriculum learning to enhance the realism and personalization of human image generation models.

Contribution

The paper proposes HG-DPO, a novel DPO framework incorporating real images and curriculum learning to significantly improve human image synthesis quality and personalization.

Findings

01

Enhanced realism in generated human images.

02

Effective personalization for identity-specific image generation.

03

Improved training stability and convergence.

Abstract

Human image generation is a key focus in image synthesis due to its broad applications, but even slight inaccuracies in anatomy, pose, or details can compromise realism. To address these challenges, we explore Direct Preference Optimization (DPO), which trains models to generate preferred (winning) images while diverging from non-preferred (losing) ones. However, conventional DPO methods use generated images as winning images, limiting realism. To overcome this limitation, we propose an enhanced DPO approach that incorporates high-quality real images as winning images, encouraging outputs to resemble real images rather than generated ones. However, implementing this concept is not a trivial task. Therefore, our approach, HG-DPO (Human image Generation through DPO), employs a novel curriculum learning framework that gradually improves the output of the model toward greater realism,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation

MethodsFocus · Direct Preference Optimization · Diffusion