DistrEE: Distributed Early Exit of Deep Neural Network Inference on Edge Devices
Xian Peng, Xin Wu, Lianming Xu, Li Wang, Aiguo Fei

TL;DR
DistrEE is a framework that enables distributed deep neural network inference on edge devices with early exit strategies to optimize latency and accuracy trade-offs in resource-constrained environments.
Contribution
It introduces a novel distributed inference framework with early exit policies, combining model early exit and collaborative inference for edge devices.
Findings
Reduces inference latency while maintaining accuracy
Efficiently balances resource usage and inference quality
Demonstrates effectiveness through extensive simulations
Abstract
Distributed DNN inference is becoming increasingly important as the demand for intelligent services at the network edge grows. By leveraging the power of distributed computing, edge devices can perform complicated and resource-hungry inference tasks previously only possible on powerful servers, enabling new applications in areas such as autonomous vehicles, industrial automation, and smart homes. However, it is challenging to achieve accurate and efficient distributed edge inference due to the fluctuating nature of the actual resources of the devices and the processing difficulty of the input data. In this work, we propose DistrEE, a distributed DNN inference framework that can exit model inference early to meet specific quality of service requirements. In particular, the framework firstly integrates model early exit and distributed inference for multi-node collaborative inferencing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Explainable Artificial Intelligence (XAI) · Privacy-Preserving Technologies in Data
Methodstravel james
