WildCap: Facial Albedo Capture in the Wild via Hybrid Inverse Rendering

Yuxuan Han; Xin Ming; Tianxiao Li; Zhuofan Shen; Qixuan Zhang; Lan Xu; Feng Xu

arXiv:2512.11237·cs.CV·March 18, 2026

WildCap: Facial Albedo Capture in the Wild via Hybrid Inverse Rendering

Yuxuan Han, Xin Ming, Tianxiao Li, Zhuofan Shen, Qixuan Zhang, Lan Xu, Feng Xu

PDF

Open Access

TL;DR

WildCap introduces a hybrid inverse rendering approach that enables high-quality facial albedo capture from smartphone videos in natural settings, overcoming lighting complexities and local artifacts to match controlled environment quality.

Contribution

The paper presents a novel hybrid inverse rendering framework with a texel grid lighting model and diffusion prior optimization, advancing in-the-wild facial albedo capture technology.

Findings

01

Significantly improves in-the-wild facial albedo quality.

02

Reduces artifacts like shadow-baking in albedo predictions.

03

Closes the quality gap between in-the-wild and controlled captures.

Abstract

Existing methods achieve high-quality facial albedo capture under controllable lighting, which increases capture cost and limits usability. We propose WildCap, a novel method for high-quality facial albedo capture from a smartphone video recorded in the wild. To disentangle high-quality albedo from complex lighting effects in in-the-wild captures, we propose a novel hybrid inverse rendering framework. We first apply a data-driven method, i.e., SwitchLight, to convert the captured images into more constrained conditions and then adopt model-based inverse rendering. However, unavoidable local artifacts in network predictions, such as shadow-baking, are non-physical and thus hinder accurate inverse rendering of lighting and material. To address this, we propose a novel texel grid lighting model to explain non-physical effects as clean albedo illuminated by local physical lighting. During…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Facial Rejuvenation and Surgery Techniques