Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning
Cong Hu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler

TL;DR
This paper introduces DED-GAN, a novel dual encoder-decoder architecture for disentangled facial representation learning, improving pose-invariant recognition and face synthesis by integrating continuous pose estimation and Wasserstein loss.
Contribution
The paper proposes a dual encoder-decoder GAN framework that effectively disentangles pose and identity in facial images, with continuous pose modeling and improved training stability.
Findings
Outperforms state-of-the-art in pose-invariant face recognition
Achieves high-quality face synthesis across poses
Demonstrates robustness on diverse datasets
Abstract
To learn disentangled representations of facial images, we present a Dual Encoder-Decoder based Generative Adversarial Network (DED-GAN). In the proposed method, both the generator and discriminator are designed with deep encoder-decoder architectures as their backbones. To be more specific, the encoder-decoder structured generator is used to learn a pose disentangled face representation, and the encoder-decoder structured discriminator is tasked to perform real/fake classification, face reconstruction, determining identity and estimating face pose. We further improve the proposed network architecture by minimising the additional pixel-wise loss defined by the Wasserstein distance at the output of the discriminator so that the adversarial framework can be better trained. Additionally, we consider face pose variation to be continuous, rather than discrete in existing literature, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face and Expression Recognition
