WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections
Yuze Wang, Junyi Wang, Yue Qi

TL;DR
This paper introduces WE-GS, an efficient 3D Gaussian-based framework for novel view synthesis from unconstrained photo collections, featuring a residual spherical harmonic transfer module and a spatial attention mechanism for improved rendering quality and speed.
Contribution
It extends 3D Gaussian Splatting with a residual spherical harmonic transfer and a plug-and-play spatial attention module, enabling better adaptation to lighting variations and occlusions in unconstrained photos.
Findings
Outperforms existing methods in rendering quality and speed.
Effectively adapts to lighting conditions and occlusions.
Achieves high convergence and real-time rendering.
Abstract
Novel View Synthesis (NVS) from unconstrained photo collections is challenging in computer graphics. Recently, 3D Gaussian Splatting (3DGS) has shown promise for photorealistic and real-time NVS of static scenes. Building on 3DGS, we propose an efficient point-based differentiable rendering framework for scene reconstruction from photo collections. Our key innovation is a residual-based spherical harmonic coefficients transfer module that adapts 3DGS to varying lighting conditions and photometric post-processing. This lightweight module can be pre-computed and ensures efficient gradient propagation from rendered images to 3D Gaussian attributes. Additionally, we observe that the appearance encoder and the transient mask predictor, the two most critical parts of NVS from unconstrained photo collections, can be mutually beneficial. We introduce a plug-and-play lightweight spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image and Video Retrieval Techniques
MethodsConvolution · Max Pooling · Average Pooling · Sigmoid Activation
