AlphaFace: High Fidelity and Real-time Face Swapper Robust to Facial Pose

Jongmin Yu; Hyeontaek Oh; Zhongtian Sun; Angelica I Aviles-Rivero; Moongu Jeon; and Jinhong Yang

arXiv:2601.16429·cs.CV·January 26, 2026

AlphaFace: High Fidelity and Real-time Face Swapper Robust to Facial Pose

Jongmin Yu, Hyeontaek Oh, Zhongtian Sun, Angelica I Aviles-Rivero, Moongu Jeon, and Jinhong Yang

PDF

Open Access

TL;DR

AlphaFace is a real-time face swapping method that uses vision-language models and contrastive losses to improve identity preservation and attribute accuracy, especially in extreme facial poses.

Contribution

It introduces a novel approach combining vision-language models with contrastive learning to enhance face swapping robustness and real-time performance.

Findings

01

Outperforms state-of-the-art in pose-challenging scenarios

02

Maintains real-time processing speeds

03

Improves identity and attribute preservation

Abstract

Existing face-swapping methods often deliver competitive results in constrained settings but exhibit substantial quality degradation when handling extreme facial poses. To improve facial pose robustness, explicit geometric features are applied, but this approach remains problematic since it introduces additional dependencies and increases computational cost. Diffusion-based methods have achieved remarkable results; however, they are impractical for real-time processing. We introduce AlphaFace, which leverages an open-source vision-language model and CLIP image and text embeddings to apply novel visual and textual semantic contrastive losses. AlphaFace enables stronger identity representation and more precise attribute preservation, all while maintaining real-time performance. Comprehensive experiments across FF++, MPIE, and LPFF demonstrate that AlphaFace surpasses state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face Recognition and Perception