Transformer-based Image and Video Inpainting: Current Challenges and Future Directions
Omar Elharrouss, Rafat Damseh, Abdelkader Nasreddine Belkacem, Elarbi, Badidi, Abderrahmane Lakas

TL;DR
This paper reviews transformer-based methods for image and video inpainting, highlighting recent advancements, challenges, and future research directions in leveraging self-attention mechanisms for global context understanding.
Contribution
It provides a comprehensive categorization and analysis of transformer-based inpainting techniques, offering insights and guidelines for future research in the field.
Findings
Transformers improve global context capture in inpainting tasks.
Recent methods show enhanced texture and structure reconstruction.
Challenges include computational complexity and handling diverse damage types.
Abstract
Image inpainting is currently a hot topic within the field of computer vision. It offers a viable solution for various applications, including photographic restoration, video editing, and medical imaging. Deep learning advancements, notably convolutional neural networks (CNNs) and generative adversarial networks (GANs), have significantly enhanced the inpainting task with an improved capability to fill missing or damaged regions in an image or video through the incorporation of contextually appropriate details. These advancements have improved other aspects, including efficiency, information preservation, and achieving both realistic textures and structures. Recently, visual transformers have been exploited and offer some improvements to image or video inpainting. The advent of transformer-based architectures, which were initially designed for natural language processing, has also been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Image Processing Techniques · Digital Media Forensic Detection
MethodsFocus · Inpainting
