Automatic Generation of Semantic Parts for Face Image Synthesis
Tomaso Fontanini, Claudio Ferrari, Massimo Bertozzi, Andrea Prati

TL;DR
This paper introduces a neural network architecture that automatically manipulates and generates semantic segmentation masks for face images, enabling automatic control over shape and texture in face image synthesis.
Contribution
The proposed model allows class-wise embedding and editing of masks, enabling automatic shape manipulation for face image synthesis, a feature previously limited to manual editing.
Findings
Model faithfully reconstructs and modifies masks.
Enables automatic shape manipulation in face synthesis.
Supports fully automatic control of shape and texture.
Abstract
Semantic image synthesis (SIS) refers to the problem of generating realistic imagery given a semantic segmentation mask that defines the spatial layout of object classes. Most of the approaches in the literature, other than the quality of the generated images, put effort in finding solutions to increase the generation diversity in terms of style i.e. texture. However, they all neglect a different feature, which is the possibility of manipulating the layout provided by the mask. Currently, the only way to do so is manually by means of graphical users interfaces. In this paper, we describe a network architecture to address the problem of automatically manipulating or generating the shape of object classes in semantic segmentation masks, with specific focus on human faces. Our proposed model allows embedding the mask class-wise into a latent space where each class embedding can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques
MethodsTanh Activation · Sigmoid Activation · Focus · Long Short-Term Memory
