PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in   Spatiotemporal Predictive Learning

Yunbo Wang; Zhifeng Gao; Mingsheng Long; Jianmin Wang; Philip S. Yu

arXiv:1804.06300·cs.LG·November 20, 2018·315 cites

PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning

Yunbo Wang, Zhifeng Gao, Mingsheng Long, Jianmin Wang, Philip S. Yu

PDF

Open Access 5 Repos

TL;DR

PredRNN++ introduces a novel recurrent network with causal LSTM units and a gradient highway architecture, significantly improving spatiotemporal modeling and long-term dependency capture in video prediction tasks.

Contribution

The paper proposes a new recurrent unit called causal LSTM and a gradient highway architecture to enhance deep-in-time video predictive learning.

Findings

01

Achieves state-of-the-art results on synthetic and real datasets.

02

Effectively alleviates the vanishing gradient problem.

03

Handles complex occlusion scenarios successfully.

Abstract

We present PredRNN++, an improved recurrent network for video predictive learning. In pursuit of a greater spatiotemporal modeling capability, our approach increases the transition depth between adjacent states by leveraging a novel recurrent unit, which is named Causal LSTM for re-organizing the spatial and temporal memories in a cascaded mechanism. However, there is still a dilemma in video predictive learning: increasingly deep-in-time models have been designed for capturing complex variations, while introducing more difficulties in the gradient back-propagation. To alleviate this undesirable effect, we propose a Gradient Highway architecture, which provides alternative shorter routes for gradient flows from outputs back to long-range inputs. This architecture works seamlessly with causal LSTMs, enabling PredRNN++ to capture short-term and long-term dependencies adaptively. We assess…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory