The Effects of Partitioning Strategies on Energy Consumption in Distributed CNN Inference at The Edge
Erqian Tang, Xiaotian Guo, Todor Stefanov

TL;DR
This paper investigates how four different CNN partitioning strategies impact energy consumption on edge devices during distributed inference, aiming to identify the most energy-efficient approach for resource-constrained environments.
Contribution
It provides the first comparative analysis of four CNN partitioning strategies' effects on energy consumption at the edge, guiding energy-efficient distributed inference.
Findings
Certain partitioning strategies significantly reduce energy per device.
Partitioning strategies' effectiveness varies with network size and device capabilities.
The study offers insights into optimizing energy consumption for edge AI deployments.
Abstract
Nowadays, many AI applications utilizing resource-constrained edge devices (e.g., small mobile robots, tiny IoT devices, etc.) require Convolutional Neural Network (CNN) inference on a distributed system at the edge due to limited resources of a single edge device to accommodate and execute a large CNN. There are four main partitioning strategies that can be utilized to partition a large CNN model and perform distributed CNN inference on multiple devices at the edge. However, to the best of our knowledge, no research has been conducted to investigate how these four partitioning strategies affect the energy consumption per edge device. Such an investigation is important because it will reveal the potential of these partitioning strategies to be used effectively for reduction of the per-device energy consumption when a large CNN model is deployed for distributed inference at the edge.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Advanced Memory and Neural Computing · Advanced Neural Network Applications
