A Unified Architecture of Semantic Segmentation and Hierarchical Generative Adversarial Networks for Expression Manipulation
Rumeysa Bodur, Binod Bhattarai, Tae-Kyun Kim

TL;DR
This paper introduces an end-to-end differentiable framework combining semantic segmentation and hierarchical GANs for facial expression manipulation, improving quality and accuracy by exploiting semantic regions.
Contribution
The authors propose a unified architecture that integrates semantic segmentation with hierarchical GANs, enabling end-to-end training and better expression editing quality.
Findings
Outperforms state-of-the-art methods on AffectNet and RaFD benchmarks.
Achieves superior semantic segmentation accuracy on CelebAMask-HQ.
Produces high-quality, realistic expression manipulations with fewer artifacts.
Abstract
Editing facial expressions by only changing what we want is a long-standing research problem in Generative Adversarial Networks (GANs) for image manipulation. Most of the existing methods that rely only on a global generator usually suffer from changing unwanted attributes along with the target attributes. Recently, hierarchical networks that consist of both a global network dealing with the whole image and multiple local networks focusing on local parts are showing success. However, these methods extract local regions by bounding boxes centred around the sparse facial key points which are non-differentiable, inaccurate and unrealistic. Hence, the solution becomes sub-optimal, introduces unwanted artefacts degrading the overall quality of the synthetic images. Moreover, a recent study has shown strong correlation between facial attributes and local semantic regions. To exploit this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacial Nerve Paralysis Treatment and Research · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis
