Generative Omnimatte: Learning to Decompose Video into Layers

Yao-Chih Lee; Erika Lu; Sarah Rumbley; Michal Geyer; Jia-Bin Huang,; Tali Dekel; Forrester Cole

arXiv:2411.16683·cs.CV·March 25, 2025

Generative Omnimatte: Learning to Decompose Video into Layers

Yao-Chih Lee, Erika Lu, Sarah Rumbley, Michal Geyer, Jia-Bin Huang,, Tali Dekel, Forrester Cole

PDF

Open Access

TL;DR

This paper introduces a generative layered video decomposition framework that accurately separates objects and effects in videos without relying on static backgrounds or depth data, enabling high-quality editing and occlusion completion.

Contribution

It proposes a novel diffusion-based approach that does not assume scene stationarity or depth, allowing for effective decomposition and editing of complex, dynamic videos.

Findings

01

Produces clean, complete object layers with effects

02

Handles dynamic occlusions with convincing completions

03

Works on casually captured videos with complex effects

Abstract

Given a video and a set of input object masks, an omnimatte method aims to decompose the video into semantically meaningful layers containing individual objects along with their associated effects, such as shadows and reflections. Existing omnimatte methods assume a static background or accurate pose and depth estimation and produce poor decompositions when these assumptions are violated. Furthermore, due to the lack of generative prior on natural videos, existing methods cannot complete dynamic occluded regions. We present a novel generative layered video decomposition framework to address the omnimatte problem. Our method does not assume a stationary scene or require camera pose or depth information and produces clean, complete layers, including convincing completions of occluded dynamic regions. Our core idea is to train a video diffusion model to identify and remove scene effects…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimedia Communication and Technology · Educational Tools and Methods · Digital Games and Media

MethodsSparse Evolutionary Training · Diffusion · Inpainting