PointPillars Backbone Type Selection For Fast and Accurate LiDAR Object Detection
Konrad Lis, Tomasz Kryjak

TL;DR
This paper investigates how different backbone neural network architectures affect the speed and accuracy of LiDAR-based 3D object detection using PointPillars, demonstrating significant speed improvements with minimal accuracy loss.
Contribution
It provides a comparative analysis of 10 neural network backbones for PointPillars, highlighting the trade-offs between detection accuracy and computational speed.
Findings
MobilenetV1 achieves nearly 4x speedup with 1.13% lower mAP.
CSPDarknet offers over 1.5x acceleration with a 0.33% increase in mAP.
Small decreases in detection efficiency enable faster processing suitable for embedded systems.
Abstract
3D object detection from LiDAR sensor data is an important topic in the context of autonomous cars and drones. In this paper, we present the results of experiments on the impact of backbone selection of a deep convolutional neural network on detection accuracy and computation speed. We chose the PointPillars network, which is characterised by a simple architecture, high speed, and modularity that allows for easy expansion. During the experiments, we paid particular attention to the change in detection efficiency (measured by the mAP metric) and the total number of multiply-addition operations needed to process one point cloud. We tested 10 different convolutional neural network architectures that are widely used in image-based detection problems. For a backbone like MobilenetV1, we obtained an almost 4x speedup at the cost of a 1.13% decrease in mAP. On the other hand, for CSPDarknet we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Autonomous Vehicle Technology and Safety
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
