Disentangling Video with Independent Prediction

William F. Whitney; Rob Fergus

arXiv:1901.05590·cs.LG·January 27, 2019·1 cites

Disentangling Video with Independent Prediction

William F. Whitney, Rob Fergus

PDF

Open Access

TL;DR

This paper introduces an unsupervised variational model that disentangles videos into independent, interpretable factors, enabling future prediction of each factor from its past without interference from others.

Contribution

The paper presents a novel unsupervised variational approach for video disentanglement that produces interpretable factors as objects in scenes.

Findings

01

Factors are often interpretable as scene objects.

02

The model successfully predicts future states of individual factors.

03

Disentanglement improves understanding of scene dynamics.

Abstract

We propose an unsupervised variational model for disentangling video into independent factors, i.e. each factor's future can be predicted from its past without considering the others. We show that our approach often learns factors which are interpretable as objects in a scene.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Chaos-based Image/Signal Encryption · Advanced Steganography and Watermarking Techniques