Temporal Generative Adversarial Nets with Singular Value Clipping

Masaki Saito; Eiichi Matsumoto; Shunta Saito

arXiv:1611.06624·cs.LG·August 21, 2017·35 cites

Temporal Generative Adversarial Nets with Singular Value Clipping

Masaki Saito, Eiichi Matsumoto, Shunta Saito

PDF

Open Access 4 Repos

TL;DR

This paper introduces Temporal Generative Adversarial Nets (TGAN), a novel video generation model that uses separate temporal and image generators, trained with Wasserstein GAN for stability, capable of learning semantic video representations.

Contribution

The paper presents a new TGAN architecture with separate generators for temporal and image features, and a stable training method using Wasserstein GAN, advancing video generation capabilities.

Findings

01

Effective video generation demonstrated in experiments

02

Stable end-to-end training achieved with Wasserstein GAN

03

Model captures semantic representations of unlabeled videos

Abstract

In this paper, we propose a generative model, Temporal Generative Adversarial Nets (TGAN), which can learn a semantic representation of unlabeled videos, and is capable of generating videos. Unlike existing Generative Adversarial Nets (GAN)-based methods that generate videos with a single generator consisting of 3D deconvolutional layers, our model exploits two different types of generators: a temporal generator and an image generator. The temporal generator takes a single latent variable as input and outputs a set of latent variables, each of which corresponds to an image frame in a video. The image generator transforms a set of such latent variables into a video. To deal with instability in training of GAN with such advanced networks, we adopt a recently proposed model, Wasserstein GAN, and propose a novel method to train it stably in an end-to-end manner. The experimental results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Advanced Vision and Imaging

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Tanh Activation · Kaiming Initialization · HuMan(Expedia)||How do I get a human at Expedia? · Linear Layer · Singular Value Clipping · RMSProp · TGAN · Convolution