TL;DR
This paper explores the importance of adaptive deep neural networks in edge computing, showing that network conditions significantly impact performance and that dynamic partitioning can improve efficiency.
Contribution
It provides an empirical analysis demonstrating the sensitivity of DNNs to network conditions and advocates for adaptive DNNs to optimize performance in edge environments.
Findings
Network conditions affect DNN performance more than CPU or memory.
Repartitioning DNNs can improve performance in certain scenarios.
No clear correlation between hardware architecture and optimal DNN partitioning.
Abstract
Edge computing offers an additional layer of compute infrastructure closer to the data source before raw data from privacy-sensitive and performance-critical applications is transferred to a cloud data center. Deep Neural Networks (DNNs) are one class of applications that are reported to benefit from collaboratively computing between the edge and the cloud. A DNN is partitioned such that specific layers of the DNN are deployed onto the edge and the cloud to meet performance and privacy objectives. However, there is limited understanding of: (a) whether and how evolving operational conditions (increased CPU and memory utilization at the edge or reduced data transfer rates between the edge and the cloud) affect the performance of already deployed DNNs, and (b) whether a new partition configuration is required to maximize performance. A DNN that adapts to changing operational conditions is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
