SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal
Zhaoyang Sun, Yaxiong Chen, Shengwu Xiong

TL;DR
The paper introduces SSAT, a transformer network that achieves both makeup transfer and removal by learning semantic correspondences, resulting in more accurate and robust makeup editing across images and videos.
Contribution
A novel symmetric semantic-aware transformer network with a semantic correspondence module for simultaneous makeup transfer and removal.
Findings
Outperforms existing methods in visual accuracy
Effective in handling pose, expression, and occlusion variations
Successfully extended to video makeup transfer
Abstract
Makeup transfer is not only to extract the makeup style of the reference image, but also to render the makeup style to the semantic corresponding position of the target image. However, most existing methods focus on the former and ignore the latter, resulting in a failure to achieve desired results. To solve the above problems, we propose a unified Symmetric Semantic-Aware Transformer (SSAT) network, which incorporates semantic correspondence learning to realize makeup transfer and removal simultaneously. In SSAT, a novel Symmetric Semantic Corresponding Feature Transfer (SSCFT) module and a weakly supervised semantic loss are proposed to model and facilitate the establishment of accurate semantic correspondence. In the generation process, the extracted makeup features are spatially distorted by SSCFT to achieve semantic alignment with the target image, then the distorted makeup…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis · Image Enhancement Techniques
MethodsAttention Is All You Need · Linear Layer · Dropout · Position-Wise Feed-Forward Layer · Layer Normalization · Byte Pair Encoding · Label Smoothing · Multi-Head Attention · Absolute Position Encodings · Softmax
