Recurrent Deconvolutional Generative Adversarial Networks with   Application to Text Guided Video Generation

Hongyuan Yu; Yan Huang; Lihong Pi; Liang Wang

arXiv:2008.05856·cs.CV·August 14, 2020·1 cites

Recurrent Deconvolutional Generative Adversarial Networks with Application to Text Guided Video Generation

Hongyuan Yu, Yan Huang, Lihong Pi, Liang Wang

PDF

Open Access

TL;DR

This paper introduces RD-GAN, a recurrent deconvolutional GAN that effectively generates realistic videos conditioned on text descriptions, addressing frame discontinuity and leveraging long-range temporal dependencies.

Contribution

It presents a novel recurrent deconvolutional generator and a 3D CNN discriminator for improved text-guided video synthesis and related tasks.

Findings

01

Successfully generates realistic videos conditioned on text

02

Achieves superior performance in video prediction and classification

03

Addresses frame discontinuity issues in video generation

Abstract

This paper proposes a novel model for video generation and especially makes the attempt to deal with the problem of video generation from text descriptions, i.e., synthesizing realistic videos conditioned on given texts. Existing video generation methods cannot be easily adapted to handle this task well, due to the frame discontinuity issue and their text-free generation schemes. To address these problems, we propose a recurrent deconvolutional generative adversarial network (RD-GAN), which includes a recurrent deconvolutional network (RDN) as the generator and a 3D convolutional neural network (3D-CNN) as the discriminator. The RDN is a deconvolutional version of conventional recurrent neural network, which can well model the long-range temporal dependency of generated video frames and make good use of conditional information. The proposed model can be jointly trained by pushing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Video Analysis and Summarization · Advanced Image Processing Techniques