NEUKONFIG: Reducing Edge Service Downtime When Repartitioning DNNs

Ayesha Abdul Majeed; Peter Kilpatrick; Ivor Spence; Blesson; Varghese

arXiv:2106.15689·cs.DC·July 1, 2021

NEUKONFIG: Reducing Edge Service Downtime When Repartitioning DNNs

Ayesha Abdul Majeed, Peter Kilpatrick, Ivor Spence, Blesson, Varghese

PDF

TL;DR

This paper introduces NEUKONFIG, a framework that significantly reduces edge service downtime during DNN repartitioning by using dynamic switching, enabling near-zero downtime with increased memory usage.

Contribution

The paper proposes a novel dynamic switching approach for DNN repartitioning that minimizes edge service downtime compared to traditional pause-resume methods.

Findings

01

Reduces downtime from 6 seconds to less than 1 millisecond with increased memory.

02

Demonstrates effectiveness on a lab-based testbed.

03

Offers a trade-off between memory usage and downtime.

Abstract

Deep Neural Networks (DNNs) may be partitioned across the edge and the cloud to improve the performance efficiency of inference. DNN partitions are determined based on operational conditions such as network speed. When operational conditions change DNNs will need to be repartitioned to maintain the overall performance. However, repartitioning using existing approaches, such as Pause and Resume, will incur a service downtime on the edge. This paper presents the NEUKONFIG framework that identifies the service downtime incurred when repartitioning DNNs and proposes approaches for reducing edge service downtime. The proposed approaches are based on 'Dynamic Switching' in which, when the network speed changes and given an existing edge-cloud pipeline, a new edge-cloud pipeline is initialised with new DNN partitions. Incoming inference requests are switched to the new pipeline for processing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.