CONTINUER: Maintaining Distributed DNN Services During Edge Failures
Ayesha Abdul Majeed, Peter Kilpatrick, Ivor Spence, Blesson, Varghese

TL;DR
CONTINUER is a framework that dynamically maintains distributed DNN services during edge node failures by estimating accuracy and latency, and selecting the optimal recovery technique to meet user-defined objectives.
Contribution
This paper introduces CONTINUER, a novel framework that estimates and selects the best technique to maintain distributed DNN services during edge failures, balancing accuracy, latency, and downtime.
Findings
Accurately estimates accuracy and latency with minimal error.
Effectively selects the best recovery technique with low overhead.
Maintains high accuracy up to 99.86% during failures.
Abstract
Partitioning and deploying Deep Neural Networks (DNNs) across edge nodes may be used to meet performance objectives of applications. However, the failure of a single node may result in cascading failures that will adversely impact the delivery of the service and will result in failure to meet specific objectives. The impact of these failures needs to be minimised at runtime. Three techniques are explored in this paper, namely repartitioning, early-exit and skip-connection. When an edge node fails, the repartitioning technique will repartition and redeploy the DNN thus avoiding the failed nodes. The early-exit technique makes provision for a request to exit (early) before the failed node. The skip connection technique dynamically routes the request by skipping the failed nodes. This paper will leverage trade-offs in accuracy, end-to-end latency and downtime for selecting the best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · IoT and Edge/Fog Computing · Brain Tumor Detection and Classification
Methodstravel james
