Real-Time And Robust 3D Object Detection with Roadside LiDARs
Walter Zimmer, Jialong Wu, Xingcheng Zhou, Alois C. Knoll

TL;DR
This paper presents a real-time, robust 3D object detection model using roadside LiDARs, improving accuracy and domain adaptation for smart city applications in autonomous driving.
Contribution
It introduces a novel 3D detector that enhances existing models with improved accuracy and domain transfer capabilities, operating at 45 Hz for real-time roadside perception.
Findings
Outperforms baseline models significantly in accuracy
Operates at 45 Hz inference speed
Effective domain adaptation from Chinese to German datasets
Abstract
This work aims to address the challenges in autonomous driving by focusing on the 3D perception of the environment using roadside LiDARs. We design a 3D object detection model that can detect traffic participants in roadside LiDARs in real-time. Our model uses an existing 3D detector as a baseline and improves its accuracy. To prove the effectiveness of our proposed modules, we train and evaluate the model on three different vehicle and infrastructure datasets. To show the domain adaptation ability of our detector, we train it on an infrastructure dataset from China and perform transfer learning on a different dataset recorded in Germany. We do several sets of experiments and ablation studies for each module in the detector that show that our model outperforms the baseline by a significant margin, while the inference speed is at 45 Hz (22 ms). We make a significant contribution with our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Infrastructure Maintenance and Monitoring
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
