FlowFace++: Explicit Semantic Flow-supervised End-to-End Face Swapping
Yu Zhang, Hao Zeng, Bowen Ma, Wei Zhang, Zhimeng Zhang, Yu Ding,, Tangjie Lv, Changjie Fan

TL;DR
FlowFace++ introduces an end-to-end face swapping framework that uses explicit semantic flow supervision and a shape-aware discriminator to produce highly realistic swapped faces, especially under challenging conditions.
Contribution
The paper presents a novel face swapping method combining semantic flow supervision, a shape-aware discriminator, and a fine-grained face representation to improve realism and robustness.
Findings
Outperforms state-of-the-art methods in various scenarios.
Effectively handles obstructed or uneven lighting source faces.
Achieves high-quality, shape-aware face swapping results.
Abstract
This work proposes a novel face-swapping framework FlowFace++, utilizing explicit semantic flow supervision and end-to-end architecture to facilitate shape-aware face-swapping. Specifically, our work pretrains a facial shape discriminator to supervise the face swapping network. The discriminator is shape-aware and relies on a semantic flow-guided operation to explicitly calculate the shape discrepancies between the target and source faces, thus optimizing the face swapping network to generate highly realistic results. The face swapping network is a stack of a pre-trained face-masked autoencoder (MAE), a cross-attention fusion module, and a convolutional decoder. The MAE provides a fine-grained facial image representation space, which is unified for the target and source faces and thus facilitates final realistic results. The cross-attention fusion module carries out the source-to-target…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Visual Attention and Saliency Detection
MethodsMasked autoencoder
