Learning Prior Feature and Attention Enhanced Image Inpainting
Chenjie Cao, Qiaole Dong, Yanwei Fu

TL;DR
This paper introduces a novel image inpainting method that leverages Vision Transformers pre-trained with Masked AutoEncoder to incorporate richer priors and attention mechanisms, significantly improving inpainting quality.
Contribution
The paper proposes integrating MAE pre-training and attention priors from ViT into inpainting models, enhancing their ability to learn long-distance dependencies and richer priors.
Findings
Effective inpainting results on Places2 and FFHQ datasets
Outperforms traditional CNN-based inpainting methods
Demonstrates the benefit of ViT and MAE in image restoration
Abstract
Many recent inpainting works have achieved impressive results by leveraging Deep Neural Networks (DNNs) to model various prior information for image restoration. Unfortunately, the performance of these methods is largely limited by the representation ability of vanilla Convolutional Neural Networks (CNNs) backbones.On the other hand, Vision Transformers (ViT) with self-supervised pre-training have shown great potential for many visual recognition and object detection tasks. A natural question is whether the inpainting task can be greatly benefited from the ViT backbone? However, it is nontrivial to directly replace the new backbones in inpainting networks, as the inpainting is an inverse problem fundamentally different from the recognition tasks. To this end, this paper incorporates the pre-training based Masked AutoEncoder (MAE) into the inpainting model, which enjoys richer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image and Signal Denoising Methods · Advanced Image Processing Techniques
MethodsMasked autoencoder · Inpainting
