Spatio-temporal video autoencoder with differentiable memory
Viorica Patraucean, Ankur Handa, Roberto Cipolla

TL;DR
This paper introduces a novel spatio-temporal video autoencoder with differentiable memory, combining a spatial autoencoder with a nested temporal autoencoder using convolutional LSTMs for motion modeling, trained end-to-end without supervision.
Contribution
It presents a new architecture integrating a differentiable visual memory with optical flow prediction for unsupervised motion feature learning in videos.
Findings
Effective motion estimation without supervision
Improved video frame prediction accuracy
Application to weakly-supervised semantic segmentation
Abstract
We describe a new spatio-temporal video autoencoder, based on a classic spatial image autoencoder and a novel nested temporal autoencoder. The temporal encoder is represented by a differentiable visual memory composed of convolutional long short-term memory (LSTM) cells that integrate changes over time. Here we target motion changes and use as temporal decoder a robust optical flow prediction module together with an image sampler serving as built-in feedback loop. The architecture is end-to-end differentiable. At each time step, the system receives as input a video frame, predicts the optical flow based on the current observation and the LSTM memory state as a dense transformation map, and applies it to the current frame to generate the next frame. By minimising the reconstruction error between the predicted next frame and the corresponding ground truth next frame, we train the whole…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Human Pose and Action Recognition
MethodsSigmoid Activation · Tanh Activation · Solana Customer Service Number +1-833-534-1729 · Long Short-Term Memory
