Distributed Inference on Mobile Edge and Cloud: A Data-Cartography based Clustering Approach
Divya Jyoti Bajpai, Manjesh Kumar Hanawal

TL;DR
This paper presents ore, a novel distributed inference method using data cartography to classify sample complexity, enabling resource-efficient DNN deployment across mobile, edge, and cloud platforms with minimal accuracy loss.
Contribution
It introduces ore, a new approach that leverages data cartography to determine sample complexity for optimized distributed inference in resource-constrained environments.
Findings
Reduces inference costs by over 43%
Maintains accuracy drop below 0.5%
Effective across various NLP tasks
Abstract
The large size of DNNs poses a significant challenge for deployment on devices with limited resources, such as mobile, edge, and IoT platforms. To address this issue, a distributed inference framework can be utilized. In this framework, a small-scale DNN (initial layers) is deployed on mobile devices, a larger version on edge devices, and the full DNN on the cloud. Samples with low complexity (easy) can be processed on mobile, those with moderate complexity (medium) on edge devices, and high complexity (hard) samples on the cloud. Given that the complexity of each sample is unknown in advance, the crucial question in distributed inference is determining the sample complexity for appropriate DNN processing. We introduce a novel method named \our{}, which leverages the Data Cartography approach initially proposed for enhancing DNN generalization. By employing data cartography, we assess…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Complex Network Analysis Techniques · Recommender Systems and Techniques
