Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial   Transformation

Zhuangwei Zhuang; Ziyin Wang; Sitao Chen; Lizhao Liu; Hui Luo; Mingkui; Tan

arXiv:2411.12177·cs.CV·November 20, 2024

Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation

Zhuangwei Zhuang, Ziyin Wang, Sitao Chen, Lizhao Liu, Hui Luo, Mingkui, Tan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a calibration-free, attention-based spatial transformation method for 3D semantic occupancy prediction, improving robustness and efficiency in autonomous driving environments.

Contribution

It proposes a novel calibration-free spatial transformation using attention mechanisms, along with auxiliary training tasks and a query-based prediction scheme, enhancing accuracy and speed.

Findings

01

Outperforms existing methods on multiple benchmarks

02

Achieves 19.8× speedup over Co-Occ

03

Improves geometry IoU by 1.1 on OpenOccupancy

Abstract

3D semantic occupancy prediction, which seeks to provide accurate and comprehensive representations of environment scenes, is important to autonomous driving systems. For autonomous cars equipped with multi-camera and LiDAR, it is critical to aggregate multi-sensor information into a unified 3D space for accurate and robust predictions. Recent methods are mainly built on the 2D-to-3D transformation that relies on sensor calibration to project the 2D image information into the 3D space. These methods, however, suffer from two major limitations: First, they rely on accurate sensor calibration and are sensitive to the calibration noise, which limits their application in real complex environments. Second, the spatial transformation layers are computationally expensive and limit their running on an autonomous vehicle. In this work, we attempt to exploit a Robust and Efficient 3D semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iceory/reo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing and 3D Reconstruction · Advanced Image and Video Retrieval Techniques · Handwritten Text Recognition Techniques

MethodsSoftmax · Attention Is All You Need