A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data

Kavin Chandrasekaran; Sorin Grigorescu; Gijs Dubbelman; Pavol Jancura

arXiv:2411.13311·cs.CV·May 13, 2026

A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data

Kavin Chandrasekaran, Sorin Grigorescu, Gijs Dubbelman, Pavol Jancura

PDF

TL;DR

This paper proposes a resource-efficient fusion network that combines camera BEV features and raw radar RD spectrum data for improved object detection in autonomous driving, emphasizing low computational cost.

Contribution

It introduces a novel fusion approach that directly utilizes raw radar data and BEV camera features, reducing processing complexity while maintaining detection accuracy.

Findings

01

Fusion of BEV camera features with raw radar data improves detection performance.

02

The proposed method achieves competitive accuracy with lower computational complexity.

03

Evaluation on RADIal dataset demonstrates effectiveness over existing methods.

Abstract

Cameras can be used to perceive the environment around the vehicle, while affordable radar sensors are popular in autonomous driving systems as they can withstand adverse weather conditions unlike cameras. However, radar point clouds are sparser with low azimuth and elevation resolution that lack semantic and structural information of the scenes, resulting in generally lower radar detection performance. In this work, we directly use the raw range-Doppler (RD) spectrum of radar data, thus avoiding radar signal processing. We independently process camera images within the proposed comprehensive image processing pipeline. Specifically, first, we transform the camera images to Bird's-Eye View (BEV) Polar domain and extract the corresponding features with our camera encoder-decoder architecture. The resultant feature maps are fused with Range-Azimuth (RA) features, recovered from the RD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.