Cross Attention Based Style Distribution for Controllable Person Image   Synthesis

Xinyue Zhou; Mingyu Yin; Xinyuan Chen; Li Sun; Changxin Gao; Qingli Li

arXiv:2208.00712·cs.CV·August 2, 2022·5 cites

Cross Attention Based Style Distribution for Controllable Person Image Synthesis

Xinyue Zhou, Mingyu Yin, Xinyuan Chen, Li Sun, Changxin Gao, Qingli Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a cross attention style distribution module for controllable person image synthesis, enabling explicit control over pose and appearance with improved style transfer accuracy.

Contribution

It proposes a novel cross attention based style distribution mechanism that effectively aligns source styles with target poses for enhanced image synthesis.

Findings

01

Improves pose transfer quality both quantitatively and qualitatively.

02

Effectively routes color and texture based on attention between source styles and target pose.

03

Validated on pose transfer and virtual try-on tasks.

Abstract

Controllable person image synthesis task enables a wide range of applications through explicit control over body pose and appearance. In this paper, we propose a cross attention based style distribution module that computes between the source semantic styles and target pose for pose transfer. The module intentionally selects the style represented by each semantic and distributes them according to the target pose. The attention matrix in cross attention expresses the dynamic similarities between the target pose and the source styles for all semantics. Therefore, it can be utilized to route the color and texture from the source image, and is further constrained by the target parsing map to achieve a clearer objective. At the same time, to encode the source appearance accurately, the self attention among different semantic styles is also added. The effectiveness of our model is validated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xyzhouo/casd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Human Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis