StreamME: Simplify 3D Gaussian Avatar within Live Stream
Luchuan Song, Yang Zhou, Zhan Xu, Yi Zhou, Deepali Aneja, Chenliang Xu

TL;DR
StreamME introduces a rapid, on-the-fly 3D avatar reconstruction method from live videos using 3D Gaussian Splatting, enabling real-time applications while preserving privacy and reducing bandwidth.
Contribution
The paper presents a novel fast training strategy for 3D avatar reconstruction that eliminates MLPs and simplifies point cloud distribution for efficiency.
Findings
Achieves real-time 3D avatar reconstruction from live video.
Reduces training time significantly compared to traditional methods.
Supports downstream applications like animation and relighting.
Abstract
We propose StreamME, a method focuses on fast 3D avatar reconstruction. The StreamME synchronously records and reconstructs a head avatar from live video streams without any pre-cached data, enabling seamless integration of the reconstructed appearance into downstream applications. This exceptionally fast training strategy, which we refer to as on-the-fly training, is central to our approach. Our method is built upon 3D Gaussian Splatting (3DGS), eliminating the reliance on MLPs in deformable 3DGS and relying solely on geometry, which significantly improves the adaptation speed to facial expression. To further ensure high efficiency in on-the-fly training, we introduced a simplification strategy based on primary points, which distributes the point clouds more sparsely across the facial surface, optimizing points number while maintaining rendering quality. Leveraging the on-the-fly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques
