Folded Recurrent Neural Networks for Future Video Prediction

Marc Oliu; Javier Selva; Sergio Escalera

arXiv:1712.00311·cs.CV·March 19, 2018

Folded Recurrent Neural Networks for Future Video Prediction

Marc Oliu, Javier Selva, Sergio Escalera

PDF

1 Repo

TL;DR

This paper introduces bijective Gated Recurrent Units for future video prediction, reducing computational costs, mitigating error propagation, and improving prediction accuracy on multiple datasets.

Contribution

It proposes a novel bijective GRU architecture enabling efficient, explainable recurrent auto-encoders for video prediction, with reduced computational cost and enhanced performance.

Findings

01

Outperforms state-of-the-art on MMNIST and UCF101 datasets.

02

Achieves competitive results on KTH with less memory and computation.

03

Enables layer removal for model interpretability.

Abstract

Future video prediction is an ill-posed Computer Vision problem that recently received much attention. Its main challenges are the high variability in video content, the propagation of errors through time, and the non-specificity of the future frames: given a sequence of past frames there is a continuous distribution of possible futures. This work introduces bijective Gated Recurrent Units, a double mapping between the input and output of a GRU layer. This allows for recurrent auto-encoders with state sharing between encoder and decoder, stratifying the sequence representation and helping to prevent capacity problems. We show how with this topology only the encoder or decoder needs to be applied for input encoding and prediction, respectively. This reduces the computational cost and avoids re-encoding the predictions when generating a sequence of frames, mitigating the propagation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

moliusimon/frnn
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGated Recurrent Unit