Brain-Inspired Inference on Missing Video Sequence

Weimian Li; Baoyang Chen; Wenmin Wang

arXiv:1912.06980·cs.CV·December 17, 2019

Brain-Inspired Inference on Missing Video Sequence

Weimian Li, Baoyang Chen, Wenmin Wang

PDF

Open Access

TL;DR

This paper introduces a novel end-to-end neural architecture inspired by human inference to generate diverse plausible intermediate video sequences from two static frames, trained with adversarial unsupervised learning.

Contribution

The work presents a new model that learns to generate multiple plausible video sequences between two frames using latent variables and adversarial training.

Findings

01

Capable of generating diverse intermediate videos

02

Imitates human inference in video prediction

03

Effective on Moving MNIST and 2D Shapes datasets

Abstract

In this paper, we propose a novel end-to-end architecture that could generate a variety of plausible video sequences correlating two given discontinuous frames. Our work is inspired by the human ability of inference. Specifically, given two static images, human are capable of inferring what might happen in between as well as present diverse versions of their inference. We firstly train our model to learn the transformation to understand the movement trends within given frames. For the sake of imitating the inference of human, we introduce a latent variable sampled from Gaussian distribution. By means of integrating different latent variables with learned transformation features, the model could learn more various possible motion modes. Then applying these motion modes on the original frame, we could acquire various corresponding intermediate video sequence. Moreover, the framework is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging