Automatic Generation of Semantic Parts for Face Image Synthesis

Tomaso Fontanini; Claudio Ferrari; Massimo Bertozzi; Andrea Prati

arXiv:2307.05317·cs.CV·July 12, 2023

Automatic Generation of Semantic Parts for Face Image Synthesis

Tomaso Fontanini, Claudio Ferrari, Massimo Bertozzi, Andrea Prati

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural network architecture that automatically manipulates and generates semantic segmentation masks for face images, enabling automatic control over shape and texture in face image synthesis.

Contribution

The proposed model allows class-wise embedding and editing of masks, enabling automatic shape manipulation for face image synthesis, a feature previously limited to manual editing.

Findings

01

Model faithfully reconstructs and modifies masks.

02

Enables automatic shape manipulation in face synthesis.

03

Supports fully automatic control of shape and texture.

Abstract

Semantic image synthesis (SIS) refers to the problem of generating realistic imagery given a semantic segmentation mask that defines the spatial layout of object classes. Most of the approaches in the literature, other than the quality of the generated images, put effort in finding solutions to increase the generation diversity in terms of style i.e. texture. However, they all neglect a different feature, which is the possibility of manipulating the layout provided by the mask. Currently, the only way to do so is manually by means of graphical users interfaces. In this paper, we describe a network architecture to address the problem of automatically manipulating or generating the shape of object classes in semantic segmentation masks, with specific focus on human faces. Our proposed model allows embedding the mask class-wise into a latent space where each class embedding can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tfonta/semantic-vae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques

MethodsTanh Activation · Sigmoid Activation · Focus · Long Short-Term Memory