Correspondence Learning for Controllable Person Image Generation
Shilong Shen

TL;DR
This paper introduces a generative model for controllable person image synthesis that accurately transfers pose and clothing attributes by establishing dense correspondence between source and target images, resulting in high-quality, controllable person images.
Contribution
The paper proposes a novel dense correspondence-based framework that improves pose and clothing-guided person image generation with explicit structural constraints and attribute decomposition.
Findings
Outperforms state-of-the-art in pose-guided person generation
Effective in clothing-guided person image synthesis
Generates high-quality, structurally consistent images
Abstract
We present a generative model for controllable person image synthesis,as shown in Figure , which can be applied to pose-guided person image synthesis, , converting the pose of a source person image to the target pose while preserving the texture of that source person image, and clothing-guided person image synthesis, , changing the clothing texture of a source person image to the desired clothing texture. By explicitly establishing the dense correspondence between the target pose and the source image, we can effectively address the misalignment introduced by pose tranfer and generate high-quality images. Specifically, we first generate the target semantic map under the guidence of the target pose, which can provide more accurate pose representation and structural constraints during the generation process. Then, decomposed attribute encoder is used to extract the component…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Advanced Vision and Imaging
