Multimodal Face Synthesis from Visual Attributes

Xing Di; Vishal M. Patel

arXiv:2104.04362·cs.CV·January 14, 2022

Multimodal Face Synthesis from Visual Attributes

Xing Di, Vishal M. Patel

PDF

1 Repo

TL;DR

This paper introduces a novel GAN framework that synthesizes multimodal face images from visual attributes, preserving identity across modalities without needing paired training data, advancing face synthesis technology.

Contribution

A new GAN architecture with multimodal stretch-out and stretch-in modules enables identity-preserving multimodal face synthesis from attributes without paired data.

Findings

01

Effective multimodal face synthesis demonstrated

02

Outperforms state-of-the-art methods

03

Preserves identity across modalities

Abstract

Synthesis of face images from visual attributes is an important problem in computer vision and biometrics due to its applications in law enforcement and entertainment. Recent advances in deep generative networks have made it possible to synthesize high-quality face images from visual attributes. However, existing methods are specifically designed for generating unimodal images (i.e visible faces) from attributes. In this paper, we propose a novel generative adversarial network that simultaneously synthesizes identity preserving multimodal face images (i.e. visible, sketch, thermal, etc.) from visual attributes without requiring paired data in different domains for training the network. We introduce a novel generator with multimodal stretch-out modules to simultaneously synthesize multimodal face images. Additionally, multimodal stretch-in modules are introduced in the discriminator…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Andribi/A2MF_AP
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.