GaussRender: Learning 3D Occupancy with Gaussian Rendering
Lo\"ick Chambon, Eloi Zablocki, Alexandre Boulch, Micka\"el Chen, Matthieu Cord

TL;DR
GaussRender introduces a projective consistency loss using differentiable Gaussian splatting to improve 3D occupancy predictions in autonomous driving scenes, resulting in more accurate and coherent 3D reconstructions.
Contribution
The paper presents GaussRender, a novel module that enforces projective consistency via differentiable rendering, enhancing 3D occupancy learning without additional inference costs.
Findings
Significant improvements in geometric fidelity on multiple benchmarks.
State-of-the-art results on surface-sensitive metrics like RayIoU.
Effective integration with existing 3D occupancy architectures.
Abstract
Understanding the 3D geometry and semantics of driving scenes is critical for safe autonomous driving. Recent advances in 3D occupancy prediction have improved scene representation but often suffer from visual inconsistencies, leading to floating artifacts and poor surface localization. Existing voxel-wise losses (e.g., cross-entropy) fail to enforce visible geometric coherence. In this paper, we propose GaussRender, a module that improves 3D occupancy learning by enforcing projective consistency. Our key idea is to project both predicted and ground-truth 3D occupancy into 2D camera views, where we apply supervision. Our method penalizes 3D configurations that produce inconsistent 2D projections, thereby enforcing a more coherent 3D structure. To achieve this efficiently, we leverage differentiable rendering with Gaussian splatting. GaussRender seamlessly integrates with existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Pose and Action Recognition · Computer Graphics and Visualization Techniques
