RCGDet3D: Rethinking 4D Radar-Camera Fusion-based 3D Object Detection with Enhanced Radar Feature Encoding
Weiyi Xiong, Bing Zhu

TL;DR
This paper introduces RCGDet3D, a real-time 3D object detection method that enhances radar feature encoding and simplifies fusion, outperforming existing approaches in accuracy and speed.
Contribution
It proposes a novel radar feature encoding approach with Ray-centric Gaussian encoding and semantic injection, enabling efficient and accurate 3D detection.
Findings
Outperforms state-of-the-art methods in accuracy and speed
Achieves comparable or higher performance with simpler fusion
Sets new benchmarks for real-time 3D detection
Abstract
4D automotive radar is indispensable for autonomous driving due to its low cost and robustness, yet its point cloud sparsity challenges 3D object detection. Existing 4D radar-camera fusion methods focus on complex fusion strategies, trading inference speed for marginal gains. This trade-off hinders real-time deployment due to heavy computation on dense feature maps. In contrast, feature extraction from sparse radar points is less time-consuming but remains under-explored. This work uncovers that simply enhancing radar feature extraction can achieve comparable or even higher performance than elaborate fusion modules, while maintaining real-time performance. Based on this finding, we propose RCGDet3D, which centers on radar feature encoding and simplifies multi-modal fusion. Its encoder inherits from the efficient Gaussian Splatting-based Point Gaussian Encoder (PGE) in RadarGaussianDet3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
