Generative Omnimatte: Learning to Decompose Video into Layers
Yao-Chih Lee, Erika Lu, Sarah Rumbley, Michal Geyer, Jia-Bin Huang,, Tali Dekel, Forrester Cole

TL;DR
This paper introduces a generative layered video decomposition framework that accurately separates objects and effects in videos without relying on static backgrounds or depth data, enabling high-quality editing and occlusion completion.
Contribution
It proposes a novel diffusion-based approach that does not assume scene stationarity or depth, allowing for effective decomposition and editing of complex, dynamic videos.
Findings
Produces clean, complete object layers with effects
Handles dynamic occlusions with convincing completions
Works on casually captured videos with complex effects
Abstract
Given a video and a set of input object masks, an omnimatte method aims to decompose the video into semantically meaningful layers containing individual objects along with their associated effects, such as shadows and reflections. Existing omnimatte methods assume a static background or accurate pose and depth estimation and produce poor decompositions when these assumptions are violated. Furthermore, due to the lack of generative prior on natural videos, existing methods cannot complete dynamic occluded regions. We present a novel generative layered video decomposition framework to address the omnimatte problem. Our method does not assume a stationary scene or require camera pose or depth information and produces clean, complete layers, including convincing completions of occluded dynamic regions. Our core idea is to train a video diffusion model to identify and remove scene effects…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimedia Communication and Technology · Educational Tools and Methods · Digital Games and Media
MethodsSparse Evolutionary Training · Diffusion · Inpainting
