Energy-Efficient AI over a Virtualized Cloud Fog Network
Barzan A. Yosuf, Sanaa H. Mohamed, Mohamed Alenazi, Taisir E. H., El-Gorashi, Jaafar M. H. Elmirghani

TL;DR
This paper proposes an MILP-based method for optimal placement of DNN inference models in a cloud fog network to reduce energy consumption and latency, addressing the limitations of centralized cloud data centers.
Contribution
It introduces a novel formulation of DNN placement as a network embedding problem in a CFN architecture, balancing processing and networking for energy efficiency.
Findings
Significant energy savings compared to baseline CDC deployment
Effective trade-offs between processing and networking in fog nodes
Improved latency performance in edge computing scenarios
Abstract
Deep Neural Networks (DNNs) have served as a catalyst in introducing a plethora of next-generation services in the era of Internet of Things (IoT), thanks to the availability of massive amounts of data collected by the objects on the edge. Currently, DNN models are used to deliver many Artificial Intelligence (AI) services that include image and natural language processing, speech recognition, and robotics. Accordingly, such services utilize various DNN models that make it computationally intensive for deployment on the edge devices alone. Thus, most AI models are offloaded to distant cloud data centers (CDCs), which tend to consolidate large amounts of computing and storage resources into one or more CDCs. Deploying services in the CDC will inevitably lead to excessive latencies and overall increase in power consumption. Instead, fog computing allows for cloud services to be extended…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
