GaussianOcc3D: A Gaussian-Based Adaptive Multi-modal 3D Occupancy Prediction
A. Enes Doruk, Hasan F. Ates

TL;DR
GaussianOcc3D introduces a novel multi-modal 3D occupancy prediction framework using Gaussian representations, effectively combining camera and LiDAR data for improved accuracy and robustness in autonomous driving environments.
Contribution
The paper proposes GaussianOcc3D, a memory-efficient, continuous Gaussian-based multi-modal framework with four innovative modules for enhanced 3D occupancy prediction.
Findings
Achieves state-of-the-art mIoU scores on multiple benchmarks.
Demonstrates robustness in adverse weather and lighting conditions.
Effectively fuses camera and LiDAR data with uncertainty-aware reweighting.
Abstract
3D semantic occupancy prediction is a pivotal task in autonomous driving, providing a dense and fine-grained understanding of the surrounding environment, yet single-modality methods face trade-offs between camera semantics and LiDAR geometry. Existing multi-modal frameworks often struggle with modality heterogeneity, spatial misalignment, and the representation crisis--where voxels are computationally heavy and BEV alternatives are lossy. We present GaussianOcc3D, a multi-modal framework bridging camera and LiDAR through a memory-efficient, continuous 3D Gaussian representation. We introduce four modules: (1) LiDAR Depth Feature Aggregation (LDFA), using depth-wise deformable sampling to lift sparse signals onto Gaussian primitives; (2) Entropy-Based Feature Smoothing (EBFS) to mitigate domain noise; (3) Adaptive Camera-LiDAR Fusion (ACLF) with uncertainty-aware reweighting for sensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Optical Sensing Technologies
