DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices
Xueyu Hou, Yongjie Guan, Tao Han, Ning Zhang

TL;DR
DistrEdge is a deep reinforcement learning-based method that optimizes CNN inference distribution across heterogeneous edge devices, significantly improving speedup and adaptability in practical IoT environments.
Contribution
It introduces a general, adaptive CNN inference distribution strategy for heterogeneous edge devices using deep reinforcement learning.
Findings
Achieves 1.1 to 3x speedup over existing methods.
Effectively adapts to device heterogeneity and network conditions.
Demonstrates practical applicability on embedded AI devices.
Abstract
As the number of edge devices with computing resources (e.g., embedded GPUs, mobile phones, and laptops) increases, recent studies demonstrate that it can be beneficial to collaboratively run convolutional neural network (CNN) inference on more than one edge device. However, these studies make strong assumptions on the devices' conditions, and their application is far from practical. In this work, we propose a general method, called DistrEdge, to provide CNN inference distribution strategies in environments with multiple IoT edge devices. By addressing heterogeneity in devices, network conditions, and nonlinear characters of CNN computation, DistrEdge is adaptive to a wide range of cases (e.g., with different network conditions, various device types) using deep reinforcement learning technology. We utilize the latest embedded AI computing devices (e.g., NVIDIA Jetson products) to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices · Adversarial Robustness in Machine Learning
