Exploring the Impact of Virtualization on the Usability of the Deep Learning Applications
Davood G. Samani, Mohsen Amini Salehi

TL;DR
This paper investigates how virtualization platforms, resource configurations, and hardware choices affect the end-to-end inference time of deep learning applications in cloud and edge environments, providing practical deployment insights.
Contribution
It offers a comprehensive analysis of virtualization impacts on DL application performance, highlighting counter-intuitive findings and best practices for deployment optimization.
Findings
Virtualization platform choice significantly influences inference time.
Resource elasticity and CPU pinning affect virtualization overhead.
Application characteristics are crucial for optimal deployment decisions.
Abstract
Deep Learning-based (DL) applications are becoming increasingly popular and advancing at an unprecedented pace. While many research works are being undertaken to enhance Deep Neural Networks (DNN) -- the centerpiece of DL applications -- practical deployment challenges of these applications in the Cloud and Edge systems, and their impact on the usability of the applications have not been sufficiently investigated. In particular, the impact of deploying different virtualization platforms, offered by the Cloud and Edge, on the usability of DL applications (in terms of the End-to-End (E2E) inference time) has remained an open question. Importantly, resource elasticity (by means of scale-up), CPU pinning, and processor type (CPU vs GPU) configurations have shown to be influential on the virtualization overhead. Accordingly, the goal of this research is to study the impact of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Advanced Neural Network Applications
MethodsAttentive Walk-Aggregating Graph Neural Network
