Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All   Networks

Eder Santana; Matthew Emigh; Pablo Zegers; Jose C Principe

arXiv:1611.00050·cs.LG·March 16, 2017·2 cites

Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All Networks

Eder Santana, Matthew Emigh, Pablo Zegers, Jose C Principe

PDF

Open Access

TL;DR

This paper introduces a convolutional recurrent neural network with Winner-Take-All dropout designed for high-dimensional unsupervised feature learning in multi-dimensional time series, demonstrating improved object recognition in videos.

Contribution

It presents a scalable reinterpretation of Deep Predictive Coding Networks, extends Winner-Take-All Autoencoders to temporal sequences, and introduces new initialization and regularization techniques for convolutional-recurrent networks.

Findings

01

Achieved better object recognition results in videos compared to existing methods.

02

Demonstrated effective unsupervised feature learning in high-dimensional time series.

03

Provided a novel end-to-end trainable architecture with improved regularization.

Abstract

We propose a convolutional recurrent neural network, with Winner-Take-All dropout for high dimensional unsupervised feature learning in multi-dimensional time series. We apply the proposedmethod for object recognition with temporal context in videos and obtain better results than comparable methods in the literature, including the Deep Predictive Coding Networks previously proposed by Chalasani and Principe.Our contributions can be summarized as a scalable reinterpretation of the Deep Predictive Coding Networks trained end-to-end with backpropagation through time, an extension of the previously proposed Winner-Take-All Autoencoders to sequences in time, and a new technique for initializing and regularizing convolutional-recurrent neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Advanced Vision and Imaging