AutoDiCE: Fully Automated Distributed CNN Inference at the Edge

Xiaotian Guo; Andy D.Pimentel; Todor Stefanov

arXiv:2207.12113·cs.DC·July 26, 2022

AutoDiCE: Fully Automated Distributed CNN Inference at the Edge

Xiaotian Guo, Andy D.Pimentel, Todor Stefanov

PDF

Open Access 1 Repo

TL;DR

AutoDiCE is a framework that automates the partitioning and deployment of large CNNs across multiple edge devices, enabling efficient distributed inference with reduced energy and memory use.

Contribution

It introduces a fully automated system for splitting CNNs and generating deployment code for distributed inference on heterogeneous edge devices.

Findings

01

Reduces energy consumption during inference

02

Decreases memory usage on individual devices

03

Improves system throughput for distributed CNN inference

Abstract

Deep Learning approaches based on Convolutional Neural Networks (CNNs) are extensively utilized and very successful in a wide range of application areas, including image classification and speech recognition. For the execution of trained CNNs, i.e. model inference, we nowadays witness a shift from the Cloud to the Edge. Unfortunately, deploying and inferring large, compute and memory intensive CNNs on edge devices is challenging because these devices typically have limited power budgets and compute/memory resources. One approach to address this challenge is to leverage all available resources across multiple edge devices to deploy and execute a large CNN by properly partitioning the CNN and running each CNN partition on a separate edge device. Although such distribution, deployment, and execution of large CNNs on multiple edge devices is a desirable and beneficial approach, there…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

parrotsky/autodice
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Brain Tumor Detection and Classification