LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians
Jiamin Wu, Kenkun Liu, Han Gao, Xiaoke Jiang, Yao Yuan, Lei Zhang

TL;DR
LeanGaussian introduces a novel 3D Gaussian modeling approach that abandons pixel or point cloud correspondence constraints, leading to more accurate geometry, textures, and efficient rendering in novel view synthesis.
Contribution
It proposes breaking the pixel or point cloud correspondence in Gaussian modeling using deformable Transformers, improving 3D reconstruction and rendering performance.
Findings
Outperforms prior methods by ~6.1% in PSNR on ShapeNet and Google Scanned Objects datasets.
Achieves 7.2 FPS 3D reconstruction and 500 FPS rendering speeds.
Demonstrates improved geometry and texture quality in novel view synthesis.
Abstract
Recently, Gaussian splatting has demonstrated significant success in novel view synthesis. Current methods often regress Gaussians with pixel or point cloud correspondence, linking each Gaussian with a pixel or a 3D point. This leads to the redundancy of Gaussians being used to overfit the correspondence rather than the objects represented by the 3D Gaussians themselves, consequently wasting resources and lacking accurate geometries or textures. In this paper, we introduce LeanGaussian, a novel approach that treats each query in deformable Transformer as one 3D Gaussian ellipsoid, breaking the pixel or point cloud correspondence constraints. We leverage deformable decoder to iteratively refine the Gaussians layer-by-layer with the image features as keys and values. Notably, the center of each 3D Gaussian is defined as 3D reference points, which are then projected onto the image for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurface Roughness and Optical Measurements · Industrial Vision Systems and Defect Detection · Advanced X-ray and CT Imaging
MethodsByte Pair Encoding · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Residual Connection · Adam · Attention Is All You Need · Softmax · Label Smoothing · Dropout · Linear Layer
