AutoDiCE: Fully Automated Distributed CNN Inference at the Edge
Xiaotian Guo, Andy D.Pimentel, Todor Stefanov

TL;DR
AutoDiCE is a framework that automates the partitioning and deployment of large CNNs across multiple edge devices, enabling efficient distributed inference with reduced energy and memory use.
Contribution
It introduces a fully automated system for splitting CNNs and generating deployment code for distributed inference on heterogeneous edge devices.
Findings
Reduces energy consumption during inference
Decreases memory usage on individual devices
Improves system throughput for distributed CNN inference
Abstract
Deep Learning approaches based on Convolutional Neural Networks (CNNs) are extensively utilized and very successful in a wide range of application areas, including image classification and speech recognition. For the execution of trained CNNs, i.e. model inference, we nowadays witness a shift from the Cloud to the Edge. Unfortunately, deploying and inferring large, compute and memory intensive CNNs on edge devices is challenging because these devices typically have limited power budgets and compute/memory resources. One approach to address this challenge is to leverage all available resources across multiple edge devices to deploy and execute a large CNN by properly partitioning the CNN and running each CNN partition on a separate edge device. Although such distribution, deployment, and execution of large CNNs on multiple edge devices is a desirable and beneficial approach, there…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Brain Tumor Detection and Classification
