Performance landscape of resource-constrained platforms targeting DNNs
Panagiotis Miliadis, Christos-Savvas Bouganis, Dionisios Pnevmatikatos

TL;DR
This paper evaluates the performance of modern embedded platforms running state-of-the-art DNNs across various vision tasks, providing insights into latency, throughput, and optimal batch sizes for resource-constrained devices.
Contribution
It offers a comprehensive performance analysis of commodity embedded platforms with vendor SDKs on advanced DNN models, highlighting their efficiency and optimal configurations.
Findings
Modern embedded systems achieve maximum performance with modest batch sizes.
Latency gains vary significantly across different platforms and models.
The study provides practical performance benchmarks for real-world deployment.
Abstract
Over the recent years, a significant number of complex, deep neural networks have been developed for a variety of applications including speech and face recognition, computer vision in the areas of health-care, automatic translation, image classification, etc. Moreover, there is an increasing demand in deploying these networks in resource-constrained edge devices. As the computational demands of these models keep increasing, pushing to their limits the targeted devices, the constant development of new hardware systems tailored to those workloads has been observed. Since programmability of these diverse and complex platforms -- compounded by the rapid development of new DNN models -- is a major challenge, platform vendors have developed Machine Learning tailored SDKs to maximize the platform's performance. This work investigates the performance achieved on a number of modern commodity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Advanced Image and Video Retrieval Techniques
