Pose Guided Human Image Synthesis with Partially Decoupled GAN
Jianhan Wu, Jianzong Wang, Shijing Si, Xiaoyang Qu, Jing, Xiao

TL;DR
This paper introduces a novel pose-guided human image synthesis method that decouples body parts and employs multi-head attention to improve detail preservation and long-range dependency modeling, outperforming existing methods.
Contribution
The proposed method decouples human body parts and integrates a multi-head attention module, enhancing detail preservation and long-range modeling in pose transfer tasks.
Findings
Outperforms state-of-the-art methods on Market-1501 and DeepFashion datasets.
Effectively preserves detailed textures of human images during pose transfer.
Utilizes attention mechanisms to better model long-range dependencies.
Abstract
Pose Guided Human Image Synthesis (PGHIS) is a challenging task of transforming a human image from the reference pose to a target pose while preserving its style. Most existing methods encode the texture of the whole reference human image into a latent space, and then utilize a decoder to synthesize the image texture of the target pose. However, it is difficult to recover the detailed texture of the whole human image. To alleviate this problem, we propose a method by decoupling the human body into several parts (\eg, hair, face, hands, feet, \etc) and then using each of these parts to guide the synthesis of a realistic image of the person, which preserves the detailed information of the generated images. In addition, we design a multi-head attention-based module for PGHIS. Because most convolutional neural network-based methods have difficulty in modeling long-range dependency due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Advanced Vision and Imaging
