Temporal Generative Adversarial Nets with Singular Value Clipping
Masaki Saito, Eiichi Matsumoto, Shunta Saito

TL;DR
This paper introduces Temporal Generative Adversarial Nets (TGAN), a novel video generation model that uses separate temporal and image generators, trained with Wasserstein GAN for stability, capable of learning semantic video representations.
Contribution
The paper presents a new TGAN architecture with separate generators for temporal and image features, and a stable training method using Wasserstein GAN, advancing video generation capabilities.
Findings
Effective video generation demonstrated in experiments
Stable end-to-end training achieved with Wasserstein GAN
Model captures semantic representations of unlabeled videos
Abstract
In this paper, we propose a generative model, Temporal Generative Adversarial Nets (TGAN), which can learn a semantic representation of unlabeled videos, and is capable of generating videos. Unlike existing Generative Adversarial Nets (GAN)-based methods that generate videos with a single generator consisting of 3D deconvolutional layers, our model exploits two different types of generators: a temporal generator and an image generator. The temporal generator takes a single latent variable as input and outputs a set of latent variables, each of which corresponds to an image frame in a video. The image generator transforms a set of such latent variables into a video. To deal with instability in training of GAN with such advanced networks, we adopt a recently proposed model, Wasserstein GAN, and propose a novel method to train it stably in an end-to-end manner. The experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Advanced Vision and Imaging
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Tanh Activation · Kaiming Initialization · HuMan(Expedia)||How do I get a human at Expedia? · Linear Layer · Singular Value Clipping · RMSProp · TGAN · Convolution
