Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L
Woong-Chan Byun, Dong-Hee Paek, Seung-Hyun Song, Seung-Hyun Kong

TL;DR
This paper presents the first on-chip implementation of 4D radar-based 3D object detection on the Hailo-8L, enabling real-time, low-power autonomous driving perception with comparable accuracy to GPU systems.
Contribution
It introduces a tensor transformation method to adapt 5D CNN inputs for 4D tensor support on Hailo-8L, facilitating efficient on-chip deployment.
Findings
Achieves 46.47% AP_3D and 52.75% AP_BEV accuracy
Runs at 13.76 Hz inference speed
Maintains accuracy comparable to GPU-based models
Abstract
4D radar has attracted attention in autonomous driving due to its ability to enable robust 3D object detection even under adverse weather conditions. To practically deploy such technologies, it is essential to achieve real-time processing within low-power embedded environments. Addressing this, we present the first on-chip implementation of a 4D radar-based 3D object detection model on the Hailo-8L AI accelerator. Although conventional 3D convolutional neural network (CNN) architectures require 5D inputs, the Hailo-8L only supports 4D tensors, posing a significant challenge. To overcome this limitation, we introduce a tensor transformation method that reshapes 5D inputs into 4D formats during the compilation process, enabling direct deployment without altering the model structure. The proposed system achieves 46.47% AP_3D and 52.75% AP_BEV, maintaining comparable accuracy to GPU-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntegrated Circuits and Semiconductor Failure Analysis · CCD and CMOS Imaging Sensors · Physical Unclonable Functions (PUFs) and Hardware Security
MethodsSoftmax · Attention Is All You Need · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
