Latents2Segments: Disentangling the Latent Space of Generative Models for Semantic Segmentation of Face Images
Snehal Singh Tomar, A.N. Rajagopalan

TL;DR
This paper introduces Latents2Segments, a generative model that disentangles facial semantic regions in latent space, enabling accurate, fast, and prior-free face segmentation for AR/VR applications.
Contribution
It proposes a novel latent space disentanglement approach for face segmentation that eliminates the need for priors and complex preprocessing, improving speed and comparable accuracy.
Findings
Higher disentanglement of semantic regions in latent space.
13% faster inference compared to SOTA.
Achieves comparable segmentation accuracy.
Abstract
With the advent of an increasing number of Augmented and Virtual Reality applications that aim to perform meaningful and controlled style edits on images of human faces, the impetus for the task of parsing face images to produce accurate and fine-grained semantic segmentation maps is more than ever before. Few State of the Art (SOTA) methods which solve this problem, do so by incorporating priors with respect to facial structure or other face attributes such as expression and pose in their deep classifier architecture. Our endeavour in this work is to do away with the priors and complex pre-processing operations required by SOTA multi-class face segmentation models by reframing this operation as a downstream task post infusion of disentanglement with respect to facial semantic regions of interest (ROIs) in the latent space of a Generative Autoencoder model. We present results for our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · AI in cancer detection
