Surfel-based Gaussian Inverse Rendering for Fast and Relightable Dynamic Human Reconstruction from Monocular Video

Yiqun Zhao; Chenming Wu; Binbin Huang; Yihao Zhi; Chen Zhao; Jingdong Wang; Shenghua Gao

arXiv:2407.15212·cs.CV·October 10, 2025

Surfel-based Gaussian Inverse Rendering for Fast and Relightable Dynamic Human Reconstruction from Monocular Video

Yiqun Zhao, Chenming Wu, Binbin Huang, Yihao Zhi, Chen Zhao, Jingdong Wang, Shenghua Gao

PDF

Open Access

TL;DR

SGIA introduces a fast, relightable, and accurate surfel-based method for dynamic human avatar reconstruction from monocular video, enabling realistic lighting manipulation and pose changes with improved efficiency.

Contribution

The paper presents SGIA, a novel surfel-based Gaussian inverse rendering approach that models PBR properties for dynamic humans, enhancing relighting and geometry reconstruction speed and accuracy.

Findings

01

Achieves highly accurate physical property estimation.

02

Enables realistic relighting of dynamic avatars.

03

Significantly faster rendering compared to implicit methods.

Abstract

Efficient and accurate reconstruction of a relightable, dynamic clothed human avatar from a monocular video is crucial for the entertainment industry. This paper presents SGIA (Surfel-based Gaussian Inverse Avatar), which introduces efficient training and rendering for relightable dynamic human reconstruction. SGIA advances previous Gaussian Avatar methods by comprehensively modeling Physically-Based Rendering (PBR) properties for clothed human avatars, allowing for the manipulation of avatars into novel poses under diverse lighting conditions. Specifically, our approach integrates pre-integration and image-based lighting for fast light calculations that surpass the performance of existing implicit-based techniques. To address challenges related to material lighting disentanglement and accurate geometry reconstruction, we propose an innovative occlusion approximation strategy and a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings