SUG-Occ: Explicit Semantics and Uncertainty Guided Sparse Learning for Efficient 3D Occupancy Prediction
Hanlin Wu, Pengfei Lin, Ehsan Javanmardi, Naren Bao, Bo Qian, Hao Si, Manabu Tsukada

TL;DR
SUG-Occ introduces a sparse learning framework for 3D occupancy prediction that leverages explicit semantics and uncertainty to improve efficiency and accuracy in autonomous driving scenarios.
Contribution
The paper presents a novel sparse learning approach utilizing semantic and uncertainty priors, explicit unsigned distance encoding, and an object contextual representation for efficient 3D occupancy prediction.
Findings
Outperforms baselines on SemanticKITTI and Occ3D-Nuscenes datasets.
Achieves better accuracy and efficiency in 3D occupancy prediction.
Effectively exploits scene sparsity to reduce computational cost.
Abstract
3D semantic occupancy prediction has emerged as a critical perception task for autonomous driving due to its ability to offer voxel-level semantic and geometric understanding of the environment. However, such a refined representation for large-scale scenes incurs prohibitive computation, posing a significant challenge to practical real-time deployment. To address this, we propose SUGOcc, an explicit semantics and uncertainty guided sparse learning framework for efficient occupancy prediction, which exploits the inherent sparsity of 3D scenes to reduce redundant computation while maintaining geometric and semantic integrity. Specifically, we first utilize semantic and uncertainty priors to suppress image projections from free space while employing explicit unsigned distance encoding to enhance geometric consistency, thereby producing a structurally sparse representation. Secondly, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
