Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms
Jong Hwan Ko, Taesik Na, Mohammad Faisal Amir, Saibal Mukhopadhyay

TL;DR
This paper proposes a novel edge-host partitioning method for deep neural networks in IoT devices, using feature space encoding to improve energy efficiency and throughput by splitting inference tasks between edge and host platforms.
Contribution
It introduces a new partitioning approach with feature space encoding, enhancing IoT device performance by optimizing DNN inference distribution.
Findings
Partitioning at convolutional layers improves energy efficiency.
Feature space encoding increases throughput.
Significant performance gains over baseline methods.
Abstract
This paper introduces partitioning an inference task of a deep neural network between an edge and a host platform in the IoT environment. We present a DNN as an encoding pipeline, and propose to transmit the output feature space of an intermediate layer to the host. The lossless or lossy encoding of the feature space is proposed to enhance the maximum input rate supported by the edge platform and/or reduce the energy of the edge platform. Simulation results show that partitioning a DNN at the end of convolutional (feature extraction) layers coupled with feature space encoding enables significant improvement in the energy-efficiency and throughput over the baseline configurations that perform the entire inference at the edge or at the host.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
