DistrEdge: Speeding up Convolutional Neural Network Inference on   Distributed Edge Devices

Xueyu Hou; Yongjie Guan; Tao Han; Ning Zhang

arXiv:2202.01699·cs.DC·February 9, 2022

DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices

Xueyu Hou, Yongjie Guan, Tao Han, Ning Zhang

PDF

Open Access

TL;DR

DistrEdge is a deep reinforcement learning-based method that optimizes CNN inference distribution across heterogeneous edge devices, significantly improving speedup and adaptability in practical IoT environments.

Contribution

It introduces a general, adaptive CNN inference distribution strategy for heterogeneous edge devices using deep reinforcement learning.

Findings

01

Achieves 1.1 to 3x speedup over existing methods.

02

Effectively adapts to device heterogeneity and network conditions.

03

Demonstrates practical applicability on embedded AI devices.

Abstract

As the number of edge devices with computing resources (e.g., embedded GPUs, mobile phones, and laptops) increases, recent studies demonstrate that it can be beneficial to collaboratively run convolutional neural network (CNN) inference on more than one edge device. However, these studies make strong assumptions on the devices' conditions, and their application is far from practical. In this work, we propose a general method, called DistrEdge, to provide CNN inference distribution strategies in environments with multiple IoT edge devices. By addressing heterogeneity in devices, network conditions, and nonlinear characters of CNN computation, DistrEdge is adaptive to a wide range of cases (e.g., with different network conditions, various device types) using deep reinforcement learning technology. We utilize the latest embedded AI computing devices (e.g., NVIDIA Jetson products) to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices · Adversarial Robustness in Machine Learning