Deployment of Deep Neural Networks for Object Detection on Edge AI   Devices with Runtime Optimization

Lukas St\"acker; Juncong Fei; Philipp Heidenreich; Frank Bonarens,; Jason Rambach; Didier Stricker; and Christoph Stiller

arXiv:2108.08166·cs.CV·August 19, 2021

Deployment of Deep Neural Networks for Object Detection on Edge AI Devices with Runtime Optimization

Lukas St\"acker, Juncong Fei, Philipp Heidenreich, Frank Bonarens,, Jason Rambach, Didier Stricker, and Christoph Stiller

PDF

TL;DR

This paper examines deploying deep neural networks for object detection on edge AI devices, focusing on runtime optimization, conversion processes, and the impact of quantization on performance and efficiency.

Contribution

It provides a case study on deploying RetinaNet and PointPillars, detailing necessary modifications, evaluation of runtime libraries, and the effects of quantization for edge deployment.

Findings

01

TensorRT slightly better for convolutional layers

02

TorchScript more efficient for fully connected layers

03

Quantization reduces runtime with minimal performance loss

Abstract

Deep neural networks have proven increasingly important for automotive scene understanding with new algorithms offering constant improvements of the detection performance. However, there is little emphasis on experiences and needs for deployment in embedded environments. We therefore perform a case study of the deployment of two representative object detection networks on an edge AI platform. In particular, we consider RetinaNet for image-based 2D object detection and PointPillars for LiDAR-based 3D object detection. We describe the modifications necessary to convert the algorithms from a PyTorch training environment to the deployment environment taking into account the available tools. We evaluate the runtime of the deployed DNN using two different libraries, TensorRT and TorchScript. In our experiments, we observe slight advantages of TensorRT for convolutional layers and TorchScript…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFeature Pyramid Network · Focal Loss · 1x1 Convolution · Convolution · RetinaNet