Learning Prior Feature and Attention Enhanced Image Inpainting

Chenjie Cao; Qiaole Dong; Yanwei Fu

arXiv:2208.01837·cs.CV·August 29, 2023·1 cites

Learning Prior Feature and Attention Enhanced Image Inpainting

Chenjie Cao, Qiaole Dong, Yanwei Fu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel image inpainting method that leverages Vision Transformers pre-trained with Masked AutoEncoder to incorporate richer priors and attention mechanisms, significantly improving inpainting quality.

Contribution

The paper proposes integrating MAE pre-training and attention priors from ViT into inpainting models, enhancing their ability to learn long-distance dependencies and richer priors.

Findings

01

Effective inpainting results on Places2 and FFHQ datasets

02

Outperforms traditional CNN-based inpainting methods

03

Demonstrates the benefit of ViT and MAE in image restoration

Abstract

Many recent inpainting works have achieved impressive results by leveraging Deep Neural Networks (DNNs) to model various prior information for image restoration. Unfortunately, the performance of these methods is largely limited by the representation ability of vanilla Convolutional Neural Networks (CNNs) backbones.On the other hand, Vision Transformers (ViT) with self-supervised pre-training have shown great potential for many visual recognition and object detection tasks. A natural question is whether the inpainting task can be greatly benefited from the ViT backbone? However, it is nontrivial to directly replace the new backbones in inpainting networks, as the inpainting is an inverse problem fundamentally different from the recognition tasks. To this end, this paper incorporates the pre-training based Masked AutoEncoder (MAE) into the inpainting model, which enjoys richer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ewrfcas/MAE-FAR
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image and Signal Denoising Methods · Advanced Image Processing Techniques

MethodsMasked autoencoder · Inpainting