LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using   LiDAR and Camera

Yukai Ma; Jianbiao Mei; Xuemeng Yang; Licheng Wen; Weihua Xu,; Jiangning Zhang; Botian Shi; Yong Liu; Xingxing Zuo

arXiv:2407.16197·cs.CV·July 24, 2024

LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera

Yukai Ma, Jianbiao Mei, Xuemeng Yang, Licheng Wen, Weihua Xu,, Jiangning Zhang, Botian Shi, Yong Liu, Xingxing Zuo

PDF

TL;DR

This paper introduces LiCROcc, a novel multi-modal fusion framework using radar, LiDAR, and camera data to improve semantic scene completion robustness and accuracy in autonomous driving, especially under adverse weather conditions.

Contribution

It proposes a three-stage BEV fusion architecture and cross-modal distillation modules to enhance radar-only and radar-camera SSC performance, surpassing baseline methods.

Findings

01

R-LiCROcc achieves 22.9% mIOU improvement over baseline.

02

RC-LiCROcc surpasses baseline by 44.1% mIOU.

03

The approach demonstrates robustness against weather and illumination changes.

Abstract

Semantic Scene Completion (SSC) is pivotal in autonomous driving perception, frequently confronted with the complexities of weather and illumination changes. The long-term strategy involves fusing multi-modal information to bolster the system's robustness. Radar, increasingly utilized for 3D target detection, is gradually replacing LiDAR in autonomous driving applications, offering a robust sensing alternative. In this paper, we focus on the potential of 3D radar in semantic scene completion, pioneering cross-modal refinement techniques for improved robustness against weather and illumination changes, and enhancing SSC performance.Regarding model architecture, we propose a three-stage tight fusion approach on BEV to realize a fusion framework for point clouds and images. Based on this foundation, we designed three cross-modal distillation modules-CMRD, BRD, and PDD. Our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus