Complexity-aware Adaptive Training and Inference for Edge-Cloud   Distributed AI Systems

Yinghan Long; Indranil Chakraborty; Gopalakrishnan Srinivasan; Kaushik; Roy

arXiv:2109.06440·cs.LG·September 15, 2021·1 cites

Complexity-aware Adaptive Training and Inference for Edge-Cloud Distributed AI Systems

Yinghan Long, Indranil Chakraborty, Gopalakrishnan Srinivasan, Kaushik, Roy

PDF

Open Access

TL;DR

This paper introduces MEANet, a distributed AI architecture that intelligently allocates inference tasks between edge devices and the cloud, optimizing accuracy and energy efficiency for IoT applications.

Contribution

The paper presents MEANet, a novel architecture with adaptive inference and training techniques that effectively balance edge and cloud processing for complex data.

Findings

01

Improved accuracy on CIFAR-100 and ImageNet datasets.

02

Reduced energy consumption during inference.

03

Effective classification of easy, hard, and complex data instances.

Abstract

The ubiquitous use of IoT and machine learning applications is creating large amounts of data that require accurate and real-time processing. Although edge-based smart data processing can be enabled by deploying pretrained models, the energy and memory constraints of edge devices necessitate distributed deep learning between the edge and the cloud for complex data. In this paper, we propose a distributed AI system to exploit both the edge and the cloud for training and inference. We propose a new architecture, MEANet, with a main block, an extension block, and an adaptive block for the edge. The inference process can terminate at either the main block, the extension block, or the cloud. The MEANet is trained to categorize inputs into easy/hard/complex classes. The main block identifies instances of easy/hard classes and classifies easy classes with high confidence. Only data with high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · IoT and Edge/Fog Computing · Machine Learning and ELM

MethodsPointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Batch Normalization · 1x1 Convolution · Inverted Residual Block · Convolution · Average Pooling