Optimal Accuracy-Time Trade-off for Deep Learning Services in Edge Computing Systems
Minoo Hosseinzadeh, Andrew Wachal, Hana Khamfroush, Daniel E. Lucani

TL;DR
This paper addresses the challenge of balancing accuracy and latency for deep learning services in edge computing by proposing a near-optimal scheduling algorithm that improves user satisfaction significantly.
Contribution
It formulates the accuracy-time trade-off as an NP-hard problem and introduces a polynomial-time greedy algorithm, GUS, that achieves near-optimal solutions for edge computing platforms.
Findings
GUS outperforms baseline heuristics by at least 50% in user satisfaction.
The problem is proven NP-hard, justifying the need for approximation algorithms.
Numerical and real-world tests validate GUS's effectiveness.
Abstract
With the increasing demand for computationally intensive services like deep learning tasks, emerging distributed computing platforms such as edge computing (EC) systems are becoming more popular. Edge computing systems have shown promising results in terms of latency reduction compared to the traditional cloud systems. However, their limited processing capacity imposes a trade-off between the potential latency reduction and the achieved accuracy in computationally-intensive services such as deep learning-based services. In this paper, we focus on finding the optimal accuracy-time trade-off for running deep learning services in a three-tier EC platform where several deep learning models with different accuracy levels are available. Specifically, we cast the problem as an Integer Linear Program, where optimal task scheduling decisions are made to maximize overall user satisfaction in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
