Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection   on Hailo-8L

Woong-Chan Byun; Dong-Hee Paek; Seung-Hyun Song; Seung-Hyun Kong

arXiv:2505.00757·cs.CV·May 5, 2025

Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L

Woong-Chan Byun, Dong-Hee Paek, Seung-Hyun Song, Seung-Hyun Kong

PDF

Open Access

TL;DR

This paper presents the first on-chip implementation of 4D radar-based 3D object detection on the Hailo-8L, enabling real-time, low-power autonomous driving perception with comparable accuracy to GPU systems.

Contribution

It introduces a tensor transformation method to adapt 5D CNN inputs for 4D tensor support on Hailo-8L, facilitating efficient on-chip deployment.

Findings

01

Achieves 46.47% AP_3D and 52.75% AP_BEV accuracy

02

Runs at 13.76 Hz inference speed

03

Maintains accuracy comparable to GPU-based models

Abstract

4D radar has attracted attention in autonomous driving due to its ability to enable robust 3D object detection even under adverse weather conditions. To practically deploy such technologies, it is essential to achieve real-time processing within low-power embedded environments. Addressing this, we present the first on-chip implementation of a 4D radar-based 3D object detection model on the Hailo-8L AI accelerator. Although conventional 3D convolutional neural network (CNN) architectures require 5D inputs, the Hailo-8L only supports 4D tensors, posing a significant challenge. To overcome this limitation, we introduce a tensor transformation method that reshapes 5D inputs into 4D formats during the compilation process, enabling direct deployment without altering the model structure. The proposed system achieves 46.47% AP_3D and 52.75% AP_BEV, maintaining comparable accuracy to GPU-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntegrated Circuits and Semiconductor Failure Analysis · CCD and CMOS Imaging Sensors · Physical Unclonable Functions (PUFs) and Hardware Security

MethodsSoftmax · Attention Is All You Need · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings