REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer

Quanwei Yang; Xinchen Liu; Wu Liu; Hongtao Xie; Xiaoyan Gu; Lingyun; Yu; Yongdong Zhang

arXiv:2209.00475·cs.CV·September 2, 2022

REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer

Quanwei Yang, Xinchen Liu, Wu Liu, Hongtao Xie, Xiaoyan Gu, Lingyun, Yu, Yongdong Zhang

PDF

Open Access

TL;DR

REMOT introduces a novel GAN-based framework for human motion transfer that generates realistic videos by progressively assembling body parts and aligning global appearance and textures, outperforming existing methods.

Contribution

The paper proposes REMOT, a region-to-whole framework with modules for global and texture alignment, improving realism and reducing artifacts in human motion transfer.

Findings

01

Achieves state-of-the-art results on public benchmarks.

02

Effectively reduces artifacts caused by pose and scale differences.

03

Demonstrates superior visual quality in generated videos.

Abstract

Human Video Motion Transfer (HVMT) aims to, given an image of a source person, generate his/her video that imitates the motion of the driving person. Existing methods for HVMT mainly exploit Generative Adversarial Networks (GANs) to perform the warping operation based on the flow estimated from the source person image and each driving video frame. However, these methods always generate obvious artifacts due to the dramatic differences in poses, scales, and shifts between the source person and the driving person. To overcome these challenges, this paper presents a novel REgionto-whole human MOtion Transfer (REMOT) framework based on GANs. To generate realistic motions, the REMOT adopts a progressive generation paradigm: it first generates each body part in the driving pose without flow-based warping, then composites all parts into a complete person of the driving motion. Moreover, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Advanced Vision and Imaging

MethodsALIGN