Enhanced AI as a Service at the Edge via Transformer Network
Vahid Pourakbar, Hamed Shah-Mansouri

TL;DR
This paper introduces an efficient offloading mechanism using transformer networks for AI as a service at the edge, significantly reducing energy consumption and task failures in resource-constrained environments.
Contribution
It proposes a novel transformer-based model for optimizing task offloading in edge AI, improving energy efficiency and reliability over existing methods.
Findings
18% reduction in energy consumption with limited resources
Significant decrease in task failure rates
Enhanced energy efficiency over baseline schemes
Abstract
Artificial intelligence (AI) has become a pivotal force in reshaping next generation mobile networks. Edge computing holds promise in enabling AI as a service (AIaaS) for prompt decision-making by offloading deep neural network (DNN) inference tasks to the edge. However, current methodologies exhibit limitations in efficiently offloading the tasks, leading to possible resource underutilization and waste of mobile devices' energy. To tackle these issues, in this paper, we study AIaaS at the edge and propose an efficient offloading mechanism for renowned DNN architectures like ResNet and VGG16. We model the inference tasks as directed acyclic graphs and formulate a problem that aims to minimize the devices' energy consumption while adhering to their latency requirements and accounting for servers' capacity. To effectively solve this problem, we utilize a transformer DNN architecture. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing
