Facial Depth and Normal Estimation using Single Dual-Pixel Camera
Minjun Kang, Jaesung Choe, Hyowon Ha, Hae-Gon Jeon, Sunghoon Im, In So, Kweon, KuK-Jin Yoon

TL;DR
This paper introduces a novel deep learning approach for 3D facial depth and normal estimation using dual-pixel camera data, supported by a large dataset and specialized modules to handle defocus blur, achieving state-of-the-art results.
Contribution
The paper presents a new DP-oriented network with adaptive modules and a large dataset for accurate 3D facial geometry reconstruction from dual-pixel images.
Findings
Achieved state-of-the-art performance in DP-based depth/normal estimation.
Demonstrated applications in face spoofing detection and relighting.
Collected a large dataset of 135K images with ground-truth 3D models.
Abstract
Many mobile manufacturers recently have adopted Dual-Pixel (DP) sensors in their flagship models for faster auto-focus and aesthetic image captures. Despite their advantages, research on their usage for 3D facial understanding has been limited due to the lack of datasets and algorithmic designs that exploit parallax in DP images. This is because the baseline of sub-aperture images is extremely narrow and parallax exists in the defocus blur region. In this paper, we introduce a DP-oriented Depth/Normal network that reconstructs the 3D facial geometry. For this purpose, we collect a DP facial data with more than 135K images for 101 persons captured with our multi-camera structured light systems. It contains the corresponding ground-truth 3D models including depth map and surface normal in metric scale. Our dataset allows the proposed matching network to be generalized for 3D facial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Vision and Imaging · Advanced Image Processing Techniques
