Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive   Architectures

Gaurav Mittal; Tanya Marwah; Vineeth N. Balasubramanian

arXiv:1611.10314·cs.CV·October 24, 2017

Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures

Gaurav Mittal, Tanya Marwah, Vineeth N. Balasubramanian

PDF

1 Repo

TL;DR

Sync-DRAW is a pioneering deep learning model that generates videos from text or noise by combining a variational autoencoder with a recurrent attention mechanism, capturing spatial and temporal dynamics effectively.

Contribution

It introduces the first approach for text-to-video generation using synchronized recurrent attention and VAE, advancing video synthesis capabilities.

Findings

01

Efficient learning of spatial and temporal video features

02

High structural integrity in generated frames

03

Successful video generation from simple captions

Abstract

This paper introduces a novel approach for generating videos called Synchronized Deep Recurrent Attentive Writer (Sync-DRAW). Sync-DRAW can also perform text-to-video generation which, to the best of our knowledge, makes it the first approach of its kind. It combines a Variational Autoencoder~(VAE) with a Recurrent Attention Mechanism in a novel manner to create a temporally dependent sequence of frames that are gradually formed over time. The recurrent attention mechanism in Sync-DRAW attends to each individual frame of the video in sychronization, while the VAE learns a latent distribution for the entire video at the global level. Our experiments with Bouncing MNIST, KTH and UCF-101 suggest that Sync-DRAW is efficient in learning the spatial and temporal information of the videos and generates frames with high structural integrity, and can generate videos from simple captions on these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Singularity42/Sync-DRAW
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsUSD Coin Customer Service Number +1-833-534-1729