AlphaFace: High Fidelity and Real-time Face Swapper Robust to Facial Pose
Jongmin Yu, Hyeontaek Oh, Zhongtian Sun, Angelica I Aviles-Rivero, Moongu Jeon, and Jinhong Yang

TL;DR
AlphaFace is a real-time face swapping method that uses vision-language models and contrastive losses to improve identity preservation and attribute accuracy, especially in extreme facial poses.
Contribution
It introduces a novel approach combining vision-language models with contrastive learning to enhance face swapping robustness and real-time performance.
Findings
Outperforms state-of-the-art in pose-challenging scenarios
Maintains real-time processing speeds
Improves identity and attribute preservation
Abstract
Existing face-swapping methods often deliver competitive results in constrained settings but exhibit substantial quality degradation when handling extreme facial poses. To improve facial pose robustness, explicit geometric features are applied, but this approach remains problematic since it introduces additional dependencies and increases computational cost. Diffusion-based methods have achieved remarkable results; however, they are impractical for real-time processing. We introduce AlphaFace, which leverages an open-source vision-language model and CLIP image and text embeddings to apply novel visual and textual semantic contrastive losses. AlphaFace enables stronger identity representation and more precise attribute preservation, all while maintaining real-time performance. Comprehensive experiments across FF++, MPIE, and LPFF demonstrate that AlphaFace surpasses state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face Recognition and Perception
