Interactive Rendering of Relightable and Animatable Gaussian Avatars
Youyi Zhan, Tianjia Shao, He Wang, Yin Yang, Kun Zhou

TL;DR
This paper introduces a fast, interactive method for rendering relightable and animatable 3D avatars from monocular videos using Gaussian Splatting, enabling real-time editing of viewpoint, pose, and lighting.
Contribution
The authors propose a novel Gaussian Splatting-based approach that decouples materials and lighting, achieving efficient, high-quality avatar rendering at interactive frame rates.
Findings
Achieves 6.9 fps rendering speed for avatars.
Produces higher quality results than previous methods.
Supports real-time editing of pose and lighting.
Abstract
Creating relightable and animatable avatars from multi-view or monocular videos is a challenging task for digital human creation and virtual reality applications. Previous methods rely on neural radiance fields or ray tracing, resulting in slow training and rendering processes. By utilizing Gaussian Splatting, we propose a simple and efficient method to decouple body materials and lighting from sparse-view or monocular avatar videos, so that the avatar can be rendered simultaneously under novel viewpoints, poses, and lightings at interactive frame rates (6.9 fps). Specifically, we first obtain the canonical body mesh using a signed distance function and assign attributes to each mesh vertex. The Gaussians in the canonical space then interpolate from nearby body mesh vertices to obtain the attributes. We subsequently deform the Gaussians to the posed space using forward skinning, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Educational Games and Gamification · Computer Graphics and Visualization Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
