Generating Long Videos of Dynamic Scenes

Tim Brooks; Janne Hellsten; Miika Aittala; Ting-Chun Wang; Timo Aila,; Jaakko Lehtinen; Ming-Yu Liu; Alexei A. Efros; Tero Karras

arXiv:2206.03429·cs.CV·June 10, 2022·25 cites

Generating Long Videos of Dynamic Scenes

Tim Brooks, Janne Hellsten, Miika Aittala, Ting-Chun Wang, Timo Aila,, Jaakko Lehtinen, Ming-Yu Liu, Alexei A. Efros, Tero Karras

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel video generation model that captures long-term dynamics, object persistence, and scene changes by redesigning temporal representations and training on longer videos, supported by new benchmarks.

Contribution

The paper proposes a new model with a two-phase training strategy and new benchmarks to improve long-term consistency and content variation in generated videos.

Findings

01

Effective long-term temporal consistency achieved

02

Model captures dynamic scene changes and object persistence

03

New benchmarks evaluate long-term video generation

Abstract

We present a video generation model that accurately reproduces object motion, changes in camera viewpoint, and new content that arises over time. Existing video generation methods often fail to produce new content as a function of time while maintaining consistencies expected in real environments, such as plausible dynamics and object persistence. A common failure case is for content to never change due to over-reliance on inductive biases to provide temporal consistency, such as a single latent code that dictates content for the entire video. On the other extreme, without long-term consistency, generated videos may morph unrealistically between different scenes. To address these limitations, we prioritize the time axis by redesigning the temporal latent representation and learning long-term consistency from data by training on longer videos. To this end, we leverage a two-phase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nvlabs/long-video-gan
pytorch

Videos

Generating Long Videos of Dynamic Scenes· slideslive

Taxonomy

TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Video Analysis and Summarization