OccMamba: Semantic Occupancy Prediction with State Space Models
Heng Li, Yuenan Hou, Xiaohan Xing, Yuexin Ma, Xiao Sun, Yanyong Zhang

TL;DR
OccMamba introduces a novel Mamba-based neural network architecture for semantic occupancy prediction that efficiently processes large 3D scene data, outperforming transformer-based models in accuracy and computational complexity.
Contribution
This paper presents the first Mamba architecture for semantic occupancy prediction, incorporating hierarchical modules and a 3D-to-1D reordering scheme for improved efficiency and accuracy.
Findings
Achieves state-of-the-art performance on OpenOccupancy, SemanticKITTI, and SemanticPOSS benchmarks.
Outperforms previous methods by 5.1% IoU and 4.3% mIoU on OpenOccupancy.
Demonstrates efficient processing of dense 3D scene grids with linear complexity.
Abstract
Training deep learning models for semantic occupancy prediction is challenging due to factors such as a large number of occupancy cells, severe occlusion, limited visual cues, complicated driving scenarios, etc. Recent methods often adopt transformer-based architectures given their strong capability in learning input-conditioned weights and long-range relationships. However, transformer-based networks are notorious for their quadratic computation complexity, seriously undermining their efficacy and deployment in semantic occupancy prediction. Inspired by the global modeling and linear computation complexity of the Mamba architecture, we present the first Mamba-based network for semantic occupancy prediction, termed OccMamba. Specifically, we first design the hierarchical Mamba module and local context processor to better aggregate global and local contextual information, respectively.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces
