Correspondence Learning for Controllable Person Image Generation

Shilong Shen

arXiv:2012.12440·cs.CV·December 24, 2020·1 cites

Correspondence Learning for Controllable Person Image Generation

Shilong Shen

PDF

Open Access

TL;DR

This paper introduces a generative model for controllable person image synthesis that accurately transfers pose and clothing attributes by establishing dense correspondence between source and target images, resulting in high-quality, controllable person images.

Contribution

The paper proposes a novel dense correspondence-based framework that improves pose and clothing-guided person image generation with explicit structural constraints and attribute decomposition.

Findings

01

Outperforms state-of-the-art in pose-guided person generation

02

Effective in clothing-guided person image synthesis

03

Generates high-quality, structurally consistent images

Abstract

We present a generative model for controllable person image synthesis,as shown in Figure , which can be applied to pose-guided person image synthesis, $i . e .$ , converting the pose of a source person image to the target pose while preserving the texture of that source person image, and clothing-guided person image synthesis, $i . e .$ , changing the clothing texture of a source person image to the desired clothing texture. By explicitly establishing the dense correspondence between the target pose and the source image, we can effectively address the misalignment introduced by pose tranfer and generate high-quality images. Specifically, we first generate the target semantic map under the guidence of the target pose, which can provide more accurate pose representation and structural constraints during the generation process. Then, decomposed attribute encoder is used to extract the component…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Advanced Vision and Imaging