Two-Stream Appearance Transfer Network for Person Image Generation

Chengkang Shen; Peiyan Wang; Wei Tang

arXiv:2011.04181·cs.CV·November 10, 2020

Two-Stream Appearance Transfer Network for Person Image Generation

Chengkang Shen, Peiyan Wang, Wei Tang

PDF

Open Access

TL;DR

This paper introduces a two-stream appearance transfer network (2s-ATN) for pose-guided person image generation, effectively handling large deformations and occlusions to produce realistic images.

Contribution

The paper presents a novel multi-stage two-stream architecture with dense correspondence and feature fusion modules for improved pose-guided person image synthesis.

Findings

01

Outperforms previous methods on benchmark datasets.

02

Effectively manages large spatial deformations and occlusions.

03

Retains detailed appearance information in generated images.

Abstract

Pose guided person image generation means to generate a photo-realistic person image conditioned on an input person image and a desired pose. This task requires spatial manipulation of the source image according to the target pose. However, the generative adversarial networks (GANs) widely used for image generation and translation rely on spatially local and translation equivariant operators, i.e., convolution, pooling and unpooling, which cannot handle large image deformation. This paper introduces a novel two-stream appearance transfer network (2s-ATN) to address this challenge. It is a multi-stage architecture consisting of a source stream and a target stream. Each stage features an appearance transfer module and several two-stream feature fusion modules. The former finds the dense correspondence between the two-stream feature maps and then transfers the appearance information from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Video Surveillance and Tracking Methods · Generative Adversarial Networks and Image Synthesis