Semantically Consistent Video Inpainting with Conditional Diffusion   Models

Dylan Green; William Harvey; Saeid Naderiparizi; Matthew Niedoba,; Yunpeng Liu; Xiaoxuan Liang; Jonathan Lavington; Ke Zhang; Vasileios Lioutas,; Setareh Dabiri; Adam Scibior; Berend Zwartsenberg; Frank Wood

arXiv:2405.00251·cs.CV·October 10, 2024

Semantically Consistent Video Inpainting with Conditional Diffusion Models

Dylan Green, William Harvey, Saeid Naderiparizi, Matthew Niedoba,, Yunpeng Liu, Xiaoxuan Liang, Jonathan Lavington, Ke Zhang, Vasileios Lioutas,, Setareh Dabiri, Adam Scibior, Berend Zwartsenberg, Frank Wood

PDF

Open Access

TL;DR

This paper introduces a novel conditional diffusion model framework for video inpainting that effectively synthesizes new content with high semantic, spatial, and temporal consistency, surpassing traditional flow or attention-based methods.

Contribution

The paper proposes a new generative approach using conditional video diffusion models with specialized sampling and conditioning techniques for improved video inpainting.

Findings

01

Capable of generating diverse, high-quality inpaintings

02

Effectively synthesizes new content consistent with context

03

Outperforms traditional methods on benchmark tasks

Abstract

Current state-of-the-art methods for video inpainting typically rely on optical flow or attention-based approaches to inpaint masked regions by propagating visual information across frames. While such approaches have led to significant progress on standard benchmarks, they struggle with tasks that require the synthesis of novel content that is not present in other frames. In this paper, we reframe video inpainting as a conditional generative modeling problem and present a framework for solving such problems with conditional video diffusion models. We introduce inpainting-specific sampling schemes which capture crucial long-range dependencies in the context, and devise a novel method for conditioning on the known pixels in incomplete frames. We highlight the advantages of using a generative approach for this task, showing that our method is capable of generating diverse, high-quality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Law in Society and Culture

MethodsInpainting · Diffusion