Combining Attention with Flow for Person Image Synthesis

Yurui Ren; Yubo Wu; Thomas H. Li; Shan Liu; Ge Li

arXiv:2108.01823·cs.CV·August 5, 2021·1 cites

Combining Attention with Flow for Person Image Synthesis

Yurui Ren, Yubo Wu, Thomas H. Li, Shan Liu, Ge Li

PDF

Open Access

TL;DR

This paper introduces a novel person image synthesis model that combines attention and flow-based operations to improve the accuracy of target structures and realism of textures, demonstrating superior results and versatility.

Contribution

The paper proposes a new model that integrates attention with flow-based operations for enhanced person image synthesis, showing improved performance over existing methods.

Findings

01

Outperforms existing models in accuracy and realism

02

Demonstrates versatility in portrait image editing

03

Validated through extensive experiments and ablation studies

Abstract

Pose-guided person image synthesis aims to synthesize person images by transforming reference images into target poses. In this paper, we observe that the commonly used spatial transformation blocks have complementary advantages. We propose a novel model by combining the attention operation with the flow-based operation. Our model not only takes the advantage of the attention operation to generate accurate target structures but also uses the flow-based operation to sample realistic source textures. Both objective and subjective experiments demonstrate the superiority of our model. Meanwhile, comprehensive ablation studies verify our hypotheses and show the efficacy of the proposed modules. Besides, additional experiments on the portrait image editing task demonstrate the versatility of the proposed combination.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Face recognition and analysis