Dual Encoder-Decoder based Generative Adversarial Networks for   Disentangled Facial Representation Learning

Cong Hu; Zhen-Hua Feng; Xiao-Jun Wu; Josef Kittler

arXiv:1909.08797·cs.CV·September 20, 2019

Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning

Cong Hu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler

PDF

Open Access

TL;DR

This paper introduces DED-GAN, a novel dual encoder-decoder architecture for disentangled facial representation learning, improving pose-invariant recognition and face synthesis by integrating continuous pose estimation and Wasserstein loss.

Contribution

The paper proposes a dual encoder-decoder GAN framework that effectively disentangles pose and identity in facial images, with continuous pose modeling and improved training stability.

Findings

01

Outperforms state-of-the-art in pose-invariant face recognition

02

Achieves high-quality face synthesis across poses

03

Demonstrates robustness on diverse datasets

Abstract

To learn disentangled representations of facial images, we present a Dual Encoder-Decoder based Generative Adversarial Network (DED-GAN). In the proposed method, both the generator and discriminator are designed with deep encoder-decoder architectures as their backbones. To be more specific, the encoder-decoder structured generator is used to learn a pose disentangled face representation, and the encoder-decoder structured discriminator is tasked to perform real/fake classification, face reconstruction, determining identity and estimating face pose. We further improve the proposed network architecture by minimising the additional pixel-wise loss defined by the Wasserstein distance at the output of the discriminator so that the adversarial framework can be better trained. Additionally, we consider face pose variation to be continuous, rather than discrete in existing literature, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face and Expression Recognition