TL;DR
Scission is an automated tool that optimizes the distribution of deep neural networks across devices, edge, and cloud resources by considering hardware capabilities, network conditions, and user constraints to maximize performance.
Contribution
The paper introduces Scission, a novel automated benchmarking tool that determines optimal DNN partitioning strategies considering multiple contextual factors.
Findings
Scission effectively identifies optimal DNN partitions for diverse hardware configurations.
Experimental results on 18 DNNs demonstrate performance improvements over manual partitioning.
Benchmarking overheads are manageable, enabling periodic re-optimization.
Abstract
Partitioning and distributing deep neural networks (DNNs) across end-devices, edge resources and the cloud has a potential twofold advantage: preserving privacy of the input data, and reducing the ingress bandwidth demand beyond the edge. However, for a given DNN, identifying the optimal partition configuration for distributing the DNN that maximizes performance is a significant challenge. This is because the combination of potential target hardware resources that maximizes performance and the sequence of layers of the DNN that should be distributed across the target resources needs to be determined, while accounting for user-defined objectives/constraints for partitioning. This paper presents Scission, a tool for automated benchmarking of DNNs on a given set of target device, edge and cloud resources for determining optimal partitions that maximize DNN performance. The decision-making…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
