An Adaptive Device-Edge Co-Inference Framework Based on Soft Actor-Critic
Tao Niu, Yinglei Teng, Zhu Han, Panpan Zou

TL;DR
This paper introduces an adaptive co-inference framework for DNNs on IoT devices, using a novel DRL optimizer to balance communication and computation, demonstrated with real-world experiments.
Contribution
It proposes a systematic on-demand co-inference framework with a DRL-based optimizer for dynamic device-edge collaboration in DNN execution.
Findings
Outperforms existing methods in latency and accuracy.
Supports 5G URLLC environments effectively.
Demonstrated on Raspberry Pi 4 and PC with real-world tests.
Abstract
Recently, the applications of deep neural network (DNN) have been very prominent in many fields such as computer vision (CV) and natural language processing (NLP) due to its superior feature extraction performance. However, the high-dimension parameter model and large-scale mathematical calculation restrict the execution efficiency, especially for Internet of Things (IoT) devices. Different from the previous cloud/edge-only pattern that brings huge pressure for uplink communication and device-only fashion that undertakes unaffordable calculation strength, we highlight the collaborative computation between the device and edge for DNN models, which can achieve a good balance between the communication load and execution accuracy. Specifically, a systematic on-demand co-inference framework is proposed to exploit the multi-branch structure, in which the pre-trained Alexnet is right-sized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Brain Tumor Detection and Classification · Neural Networks and Reservoir Computing
Methodspc · Attentive Walk-Aggregating Graph Neural Network
