Implicit Preference Alignment for Human Image Animation

Yuanzhi Wang; Xuhua Ren; Jiaxiang Cheng; Bing Ma; Kai Yu; Tianxiang Zheng; Qinglin Lu; Zhen Cui

arXiv:2605.07545·cs.CV·May 11, 2026

Implicit Preference Alignment for Human Image Animation

Yuanzhi Wang, Xuhua Ren, Jiaxiang Cheng, Bing Ma, Kai Yu, Tianxiang Zheng, Qinglin Lu, Zhen Cui

PDF

1 Repo

TL;DR

This paper introduces Implicit Preference Alignment (IPA), a data-efficient framework for improving human image animation, especially hand motions, without requiring preference pair data, by leveraging implicit reward maximization.

Contribution

The paper proposes a novel, data-efficient post-training method that aligns models using implicit reward maximization, reducing the need for costly preference data in human image animation.

Findings

01

Effective enhancement of hand motion quality in image animation.

02

Significant reduction in preference data requirements.

03

Successful implementation of Hand-Aware Local Optimization.

Abstract

Human image animation has witnessed significant advancements, yet generating high-fidelity hand motions remains a persistent challenge due to their high degrees of freedom and motion complexity. While reinforcement learning from human feedback, particularly direct preference optimization, offers a potential solution, it necessitates the construction of strict preference pairs. However, curating such pairs for dynamic hand regions is prohibitively expensive and often impractical due to frame-wise inconsistencies. In this paper, we propose Implicit Preference Alignment (IPA), a data-efficient post-training framework that eliminates the need for paired preference data. Theoretically grounded in implicit reward maximization, IPA aligns the model by maximizing the likelihood of self-generated high-quality samples while penalizing deviations from the pretrained prior. Furthermore, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mdswyz/IPA
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.