Deployment of Deep Neural Networks for Object Detection on Edge AI Devices with Runtime Optimization
Lukas St\"acker, Juncong Fei, Philipp Heidenreich, Frank Bonarens,, Jason Rambach, Didier Stricker, and Christoph Stiller

TL;DR
This paper examines deploying deep neural networks for object detection on edge AI devices, focusing on runtime optimization, conversion processes, and the impact of quantization on performance and efficiency.
Contribution
It provides a case study on deploying RetinaNet and PointPillars, detailing necessary modifications, evaluation of runtime libraries, and the effects of quantization for edge deployment.
Findings
TensorRT slightly better for convolutional layers
TorchScript more efficient for fully connected layers
Quantization reduces runtime with minimal performance loss
Abstract
Deep neural networks have proven increasingly important for automotive scene understanding with new algorithms offering constant improvements of the detection performance. However, there is little emphasis on experiences and needs for deployment in embedded environments. We therefore perform a case study of the deployment of two representative object detection networks on an edge AI platform. In particular, we consider RetinaNet for image-based 2D object detection and PointPillars for LiDAR-based 3D object detection. We describe the modifications necessary to convert the algorithms from a PyTorch training environment to the deployment environment taking into account the available tools. We evaluate the runtime of the deployed DNN using two different libraries, TensorRT and TorchScript. In our experiments, we observe slight advantages of TensorRT for convolutional layers and TorchScript…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Pyramid Network · Focal Loss · 1x1 Convolution · Convolution · RetinaNet
