Distance Weighted Trans Network for Image Completion
Pourya Shamsolmoali, Masoumeh Zareapoor, Huiyu Zhou, Xuelong Li, and, Yue Lu

TL;DR
This paper introduces a novel image completion model combining Distance-based Weighted Transformer with CNNs and Residual Fast Fourier Convolution to better understand global structures and textures, outperforming existing methods.
Contribution
The paper proposes a new architecture that integrates DWT, CNNs, and Res-FFC blocks for improved global dependency modeling and texture synthesis in image completion.
Findings
Outperforms existing methods on three datasets
Effectively models global dependencies with DWT
Enhances texture synthesis with Res-FFC blocks
Abstract
The challenge of image generation has been effectively modeled as a problem of structure priors or transformation. However, existing models have unsatisfactory performance in understanding the global input image structures because of particular inherent features (for example, local inductive prior). Recent studies have shown that self-attention is an efficient modeling technique for image completion problems. In this paper, we propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components. In our model, we leverage the strengths of both Convolutional Neural Networks (CNNs) and DWT blocks to enhance the image completion process. Specifically, CNNs are used to augment the local texture information of coarse priors and DWT blocks are used to recover certain coarse textures and coherent visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Advanced Vision and Imaging
MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Byte Pair Encoding · Linear Layer · Label Smoothing · Residual Connection · Adam · Absolute Position Encodings · Layer Normalization
