A Unified Architecture of Semantic Segmentation and Hierarchical   Generative Adversarial Networks for Expression Manipulation

Rumeysa Bodur; Binod Bhattarai; Tae-Kyun Kim

arXiv:2112.04603·cs.CV·December 10, 2021

A Unified Architecture of Semantic Segmentation and Hierarchical Generative Adversarial Networks for Expression Manipulation

Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim

PDF

Open Access

TL;DR

This paper introduces an end-to-end differentiable framework combining semantic segmentation and hierarchical GANs for facial expression manipulation, improving quality and accuracy by exploiting semantic regions.

Contribution

The authors propose a unified architecture that integrates semantic segmentation with hierarchical GANs, enabling end-to-end training and better expression editing quality.

Findings

01

Outperforms state-of-the-art methods on AffectNet and RaFD benchmarks.

02

Achieves superior semantic segmentation accuracy on CelebAMask-HQ.

03

Produces high-quality, realistic expression manipulations with fewer artifacts.

Abstract

Editing facial expressions by only changing what we want is a long-standing research problem in Generative Adversarial Networks (GANs) for image manipulation. Most of the existing methods that rely only on a global generator usually suffer from changing unwanted attributes along with the target attributes. Recently, hierarchical networks that consist of both a global network dealing with the whole image and multiple local networks focusing on local parts are showing success. However, these methods extract local regions by bounding boxes centred around the sparse facial key points which are non-differentiable, inaccurate and unrealistic. Hence, the solution becomes sub-optimal, introduces unwanted artefacts degrading the overall quality of the synthetic images. Moreover, a recent study has shown strong correlation between facial attributes and local semantic regions. To exploit this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFacial Nerve Paralysis Treatment and Research · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis