Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR
Fatemeh Ghorbani Lohesara, Karen Eguiazarian, Sebastian Knorr

TL;DR
This paper presents a geometry-aware, learning-based framework that jointly removes head-mounted display occlusions and reconstructs accurate 3D facial geometry from RGB videos, enhancing social XR experiences.
Contribution
It introduces a novel integrated approach combining GAN-based inpainting and 3D face reconstruction guided by dense landmarks and a reference frame.
Findings
Successfully removes HMD occlusions from RGB videos
Produces photorealistic 3D face geometry reconstructions
Robust across varying landmark densities
Abstract
Head-mounted displays (HMDs) are essential for experiencing extended reality (XR) environments and observing virtual content. However, they obscure the upper part of the user's face, complicating external video recording and significantly impacting social XR applications such as teleconferencing, where facial expressions and eye gaze details are crucial for creating an immersive experience. This study introduces a geometry-aware learning-based framework to jointly remove HMD occlusions and reconstruct complete 3D facial geometry from RGB frames captured from a single viewpoint. The method integrates a GAN-based video inpainting network, guided by dense facial landmarks and a single occlusion-free reference frame, to restore missing facial regions while preserving identity. Subsequently, a SynergyNet-based module regresses 3D Morphable Model (3DMM) parameters from the inpainted frames,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
