A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer
Jinxiang Liu, Yangheng Zhao, Siheng Chen, Ya Zhang

TL;DR
This paper introduces a 3D mesh-based lifting-and-projection network for human pose transfer, effectively handling occlusions and preserving details by leveraging 3D body shape priors and novel neural modules.
Contribution
It proposes a novel 3D mesh-based framework with LPNet and ADCNet that improves pose transfer quality over existing 2D and mesh-based methods.
Findings
Outperforms existing methods on iPER and Fashion datasets.
Effectively handles occlusions and preserves texture details.
Demonstrates superior results in both self-transfer and cross-transfer settings.
Abstract
Human pose transfer has typically been modeled as a 2D image-to-image translation problem. This formulation ignores the human body shape prior in 3D space and inevitably causes implausible artifacts, especially when facing occlusion. To address this issue, we propose a lifting-and-projection framework to perform pose transfer in the 3D mesh space. The core of our framework is a foreground generation module, that consists of two novel networks: a lifting-and-projection network (LPNet) and an appearance detail compensating network (ADCNet). To leverage the human body shape prior, LPNet exploits the topological information of the body mesh to learn an expressive visual representation for the target person in the 3D mesh space. To preserve texture details, ADCNet is further introduced to enhance the feature produced by LPNet with the source foreground image. Such design of the foreground…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · 3D Shape Modeling and Analysis
