ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors
Chenjie Cao, Qiaole Dong, Yanwei Fu

TL;DR
ZITS++ introduces an advanced transformer-based framework that leverages structural priors and Fourier-based texture restoration to improve high-resolution image inpainting, effectively restoring both textures and structures.
Contribution
The paper proposes ZITS++, an improved inpainting model that combines structural priors, Fourier CNNs, and incremental transformer techniques for better high-resolution image restoration.
Findings
Enhanced inpainting quality with vivid textures and structures.
Improved stability and inpainting ability over previous methods.
Effective utilization of various image priors for high-resolution inpainting.
Abstract
Image inpainting involves filling missing areas of a corrupted image. Despite impressive results have been achieved recently, restoring images with both vivid textures and reasonable structures remains a significant challenge. Previous methods have primarily addressed regular textures while disregarding holistic structures due to the limited receptive fields of Convolutional Neural Networks (CNNs). To this end, we study learning a Zero-initialized residual addition based Incremental Transformer on Structural priors (ZITS++), an improved model upon our conference work, ZITS. Specifically, given one corrupt image, we present the Transformer Structure Restorer (TSR) module to restore holistic structural priors at low image resolution, which are further upsampled by Simple Structure Upsampler (SSU) module to higher image resolution. To recover image texture details, we use the Fourier CNN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Image Enhancement Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer · Residual Connection · Dropout · Adam
