Stochastic Video Generation with a Learned Prior

Remi Denton; Rob Fergus

arXiv:1802.07687·cs.CV·March 14, 2024·304 cites

Stochastic Video Generation with a Learned Prior

Remi Denton, Rob Fergus

PDF

Open Access 3 Repos

TL;DR

This paper introduces an unsupervised stochastic video generation model that learns a prior of uncertainty, producing varied and sharp future frames that outperform existing methods.

Contribution

It presents a novel learned prior approach for stochastic video generation that captures uncertainty and improves the quality of long-term predictions.

Findings

01

Generated videos are more varied and sharper than previous methods.

02

The model is simple, end-to-end trainable, and effective across datasets.

03

Sample generations remain high quality even many frames into the future.

Abstract

Generating video frames that accurately predict future world states is challenging. Existing approaches either fail to capture the full distribution of outcomes, or yield blurry generations, or both. In this paper we introduce an unsupervised video generation model that learns a prior model of uncertainty in a given environment. Video frames are generated by drawing samples from this prior and combining them with a deterministic estimate of the future frame. The approach is simple and easily trained end-to-end on a variety of datasets. Sample generations are both varied and sharp, even many frames into the future, and compare favorably to those from existing approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis