Early-Exit meets Model-Distributed Inference at Edge Networks

Marco Colocrese; Erdem Koyuncu; Hulya Seferoglu

arXiv:2408.05247·cs.DC·August 13, 2024

Early-Exit meets Model-Distributed Inference at Edge Networks

Marco Colocrese, Erdem Koyuncu, Hulya Seferoglu

PDF

Open Access

TL;DR

This paper introduces MDI-Exit, a framework combining model-distributed inference with early-exit strategies to reduce computation and communication costs at edge networks, improving efficiency and accuracy.

Contribution

It proposes a novel adaptive framework for early-exit in model-distributed inference, optimizing offloading and data admission at the edge.

Findings

01

Higher data processing throughput at fixed accuracy.

02

Improved accuracy for given data rates.

03

Effective early-exit policy on NVIDIA Nano devices.

Abstract

Distributed inference techniques can be broadly classified into data-distributed and model-distributed schemes. In data-distributed inference (DDI), each worker carries the entire deep neural network (DNN) model but processes only a subset of the data. However, feeding the data to workers results in high communication costs, especially when the data is large. An emerging paradigm is model-distributed inference (MDI), where each worker carries only a subset of DNN layers. In MDI, a source device that has data processes a few layers of DNN and sends the output to a neighboring device, i.e., offloads the rest of the layers. This process ends when all layers are processed in a distributed manner. In this paper, we investigate the design and development of MDI with early-exit, which advocates that there is no need to process all the layers of a model for some data to reach the desired…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Advanced Graph Neural Networks · Opinion Dynamics and Social Influence