Elevating Flow-Guided Video Inpainting with Reference Generation
Suhwan Cho, Seoung Wug Oh, Sangyoun Lee, Joon-Young Lee

TL;DR
This paper presents a novel video inpainting framework that combines large generative models for reference content creation with an advanced pixel propagation technique, achieving high-quality results and supporting high-resolution videos.
Contribution
It introduces a new VI framework integrating generative reference generation and a one-shot pixel pulling method, along with a high-quality benchmark dataset for realistic evaluation.
Findings
Significantly improves frame-level quality for object removal.
Able to synthesize new content based on user prompts.
Supports high-resolution videos exceeding 2K.
Abstract
Video inpainting (VI) is a challenging task that requires effective propagation of observable content across frames while simultaneously generating new content not present in the original video. In this study, we propose a robust and practical VI framework that leverages a large generative model for reference generation in combination with an advanced pixel propagation algorithm. Powered by a strong generative model, our method not only significantly enhances frame-level quality for object removal but also synthesizes new content in the missing areas based on user-provided text prompts. For pixel propagation, we introduce a one-shot pixel pulling method that effectively avoids error accumulation from repeated sampling while maintaining sub-pixel precision. To evaluate various VI methods in realistic scenarios, we also propose a high-quality VI benchmark, HQVI, comprising carefully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Computer Graphics and Visualization Techniques
MethodsInpainting
