Exploiting video sequences for unsupervised disentangling in generative   adversarial networks

Facundo Tuesca; Lucas C. Uzal

arXiv:1910.11104·cs.CV·October 25, 2019

Exploiting video sequences for unsupervised disentangling in generative adversarial networks

Facundo Tuesca, Lucas C. Uzal

PDF

Open Access

TL;DR

This paper introduces an unsupervised method for training GANs to learn a disentangled latent space from video sequences, separating content and motion attributes without supervision.

Contribution

It proposes a simple modification to standard GAN training that leverages video correlations to achieve disentanglement of content and motion in the latent space.

Findings

01

Successfully disentangles identity and motion attributes in face videos

02

Works on datasets with different subjects and movements

03

Provides both qualitative and quantitative evidence of disentanglement

Abstract

In this work we present an adversarial training algorithm that exploits correlations in video to learn --without supervision-- an image generator model with a disentangled latent space. The proposed methodology requires only a few modifications to the standard algorithm of Generative Adversarial Networks (GAN) and involves training with sets of frames taken from short videos. We train our model over two datasets of face-centered videos which present different people speaking or moving the head: VidTIMIT and YouTube Faces datasets. We found that our proposal allows us to split the generator latent space into two subspaces. One of them controls content attributes, those that do not change along short video sequences. For the considered datasets, this is the identity of the generated face. The other subspace controls motion attributes, those attributes that are observed to change along…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Face recognition and analysis