Cross-View Image Synthesis with Deformable Convolution and Attention   Mechanism

Hao Ding; Songsong Wu; Hao Tang; Fei Wu; Guangwei Gao; Xiao-Yuan; Jing

arXiv:2007.09858·cs.CV·July 21, 2020·1 cites

Cross-View Image Synthesis with Deformable Convolution and Attention Mechanism

Hao Ding, Songsong Wu, Hao Tang, Fei Wu, Guangwei Gao, Xiao-Yuan, Jing

PDF

Open Access

TL;DR

This paper introduces a GAN-based approach utilizing deformable convolution and attention mechanisms to improve cross-view image synthesis, effectively handling large view differences and occlusions to produce more realistic images.

Contribution

It proposes a novel GAN framework with deformable convolution and attention modules specifically designed for cross-view image synthesis, enhancing feature extraction and correspondence learning.

Findings

01

Outperforms state-of-the-art methods on Dayton dataset

02

Produces more realistic images with better scene understanding

03

Effective in handling occlusions and view differences

Abstract

Learning to generate natural scenes has always been a daunting task in computer vision. This is even more laborious when generating images with very different views. When the views are very different, the view fields have little overlap or objects are occluded, leading the task very challenging. In this paper, we propose to use Generative Adversarial Networks(GANs) based on a deformable convolution and attention mechanism to solve the problem of cross-view image synthesis (see Fig.1). It is difficult to understand and transform scenes appearance and semantic information from another view, thus we use deformed convolution in the U-net network to improve the network's ability to extract features of objects at different scales. Moreover, to better learn the correspondence between images from different views, we apply an attention mechanism to refine the intermediate feature map thus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Advanced Vision and Imaging

MethodsConcatenated Skip Connection · Deformable Convolution · Max Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · U-Net · Convolution