Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Zhuangwei Zhuang, Ziyin Wang, Sitao Chen, Lizhao Liu, Hui Luo, Mingkui, Tan

TL;DR
This paper introduces a calibration-free, attention-based spatial transformation method for 3D semantic occupancy prediction, improving robustness and efficiency in autonomous driving environments.
Contribution
It proposes a novel calibration-free spatial transformation using attention mechanisms, along with auxiliary training tasks and a query-based prediction scheme, enhancing accuracy and speed.
Findings
Outperforms existing methods on multiple benchmarks
Achieves 19.8× speedup over Co-Occ
Improves geometry IoU by 1.1 on OpenOccupancy
Abstract
3D semantic occupancy prediction, which seeks to provide accurate and comprehensive representations of environment scenes, is important to autonomous driving systems. For autonomous cars equipped with multi-camera and LiDAR, it is critical to aggregate multi-sensor information into a unified 3D space for accurate and robust predictions. Recent methods are mainly built on the 2D-to-3D transformation that relies on sensor calibration to project the 2D image information into the 3D space. These methods, however, suffer from two major limitations: First, they rely on accurate sensor calibration and are sensitive to the calibration noise, which limits their application in real complex environments. Second, the spatial transformation layers are computationally expensive and limit their running on an autonomous vehicle. In this work, we attempt to exploit a Robust and Efficient 3D semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Advanced Image and Video Retrieval Techniques · Handwritten Text Recognition Techniques
MethodsSoftmax · Attention Is All You Need
