DiffuEraser: A Diffusion Model for Video Inpainting
Xiaowen Li, Haolan Xue, Peiran Ren, Liefeng Bo

TL;DR
DiffuEraser introduces a diffusion-based video inpainting model that enhances detail, structure, and temporal consistency in masked regions, outperforming existing methods especially with large masks.
Contribution
The paper presents DiffuEraser, a novel diffusion model for video inpainting that leverages prior information and expanded temporal receptive fields for improved results.
Findings
Outperforms state-of-the-art in content completeness
Achieves superior temporal consistency
Maintains acceptable efficiency
Abstract
Recent video inpainting algorithms integrate flow-based pixel propagation with transformer-based generation to leverage optical flow for restoring textures and objects using information from neighboring frames, while completing masked regions through visual Transformers. However, these approaches often encounter blurring and temporal inconsistencies when dealing with large masks, highlighting the need for models with enhanced generative capabilities. Recently, diffusion models have emerged as a prominent technique in image and video generation due to their impressive performance. In this paper, we introduce DiffuEraser, a video inpainting model based on stable diffusion, designed to fill masked regions with greater details and more coherent structures. We incorporate prior information to provide initialization and weak conditioning,which helps mitigate noisy artifacts and suppress…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Law in Society and Culture · Digital Media and Visual Art
MethodsDiffusion · Inpainting
