Radar–Camera Fusion in Perspective View and Bird’s Eye View for 3D Object Detection
Yuhao Xiao, Xiaoqing Chen, Yingkai Wang, Zhongliang Fu

TL;DR
This paper introduces a new radar-camera fusion method for 3D object detection by combining perspective and bird's eye views, achieving better accuracy than existing approaches.
Contribution
The novel dual-view fusion paradigm improves depth estimation and 3D object detection accuracy using cross-modal attention and radar image generation.
Findings
The proposed method achieves state-of-the-art performance on the nuScenes dataset with 64.2 NDS and 56.3 mAP.
Fusing perspective and bird's eye views enhances image BEV feature precision through improved depth estimation.
A radar image generation module and cross-modal fusion module are effective in combining radar and camera features.
Abstract
Three-dimensional object detection based on the fusion of millimeter-wave radar and cameras is increasingly gaining attention due to characteristics of low cost, high accuracy, and strong robustness. Recently, the bird’s eye view (BEV) fusion paradigm has dominated radar–camera fusion-based 3D object detection methods. In the BEV fusion paradigm, the detection accuracy is jointly determined by the precision of both image BEV features and radar BEV features. The precision of image BEV features is significantly influenced by depth estimation accuracy, whereas estimating depth from a monocular image is naturally a challenging, ill-posed problem. In this article, we propose a novel approach to enhance depth estimation accuracy by fusing camera perspective view (PV) features and radar perspective view features, thereby improving the precision of image BEV features. The refined image BEV…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced SAR Imaging Techniques · Infrared Target Detection Methodologies · Sparse and Compressive Sensing Techniques
