WildCap: Facial Albedo Capture in the Wild via Hybrid Inverse Rendering
Yuxuan Han, Xin Ming, Tianxiao Li, Zhuofan Shen, Qixuan Zhang, Lan Xu, Feng Xu

TL;DR
WildCap introduces a hybrid inverse rendering approach that enables high-quality facial albedo capture from smartphone videos in natural settings, overcoming lighting complexities and local artifacts to match controlled environment quality.
Contribution
The paper presents a novel hybrid inverse rendering framework with a texel grid lighting model and diffusion prior optimization, advancing in-the-wild facial albedo capture technology.
Findings
Significantly improves in-the-wild facial albedo quality.
Reduces artifacts like shadow-baking in albedo predictions.
Closes the quality gap between in-the-wild and controlled captures.
Abstract
Existing methods achieve high-quality facial albedo capture under controllable lighting, which increases capture cost and limits usability. We propose WildCap, a novel method for high-quality facial albedo capture from a smartphone video recorded in the wild. To disentangle high-quality albedo from complex lighting effects in in-the-wild captures, we propose a novel hybrid inverse rendering framework. We first apply a data-driven method, i.e., SwitchLight, to convert the captured images into more constrained conditions and then adopt model-based inverse rendering. However, unavoidable local artifacts in network predictions, such as shadow-baking, are non-physical and thus hinder accurate inverse rendering of lighting and material. To address this, we propose a novel texel grid lighting model to explain non-physical effects as clean albedo illuminated by local physical lighting. During…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Facial Rejuvenation and Surgery Techniques
