An Overview on Generative AI at Scale with Edge-Cloud Computing
Yun-Cheng Wang, Jintang Xue, Chengwei Wei, C.-C. Jay Kuo

TL;DR
This paper reviews recent advances in generative AI and edge-cloud computing, discussing technical challenges and design considerations for scaling GenAI systems efficiently at the edge and cloud interface.
Contribution
It provides a comprehensive overview of integrating GenAI with edge-cloud computing, highlighting challenges and proposing future research directions.
Findings
Edge-cloud collaboration reduces latency for GenAI services.
Technical challenges include data management and system scalability.
Design considerations improve training and deployment at scale.
Abstract
As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing framework due to the need for large computation resources. However, such services will encounter high latency because of data transmission and a high volume of requests. On the other hand, edge-cloud computing can provide adequate computation power and low latency at the same time through the collaboration between edges and the cloud. Thus, it is attractive to build GenAI systems at scale by leveraging the edge-cloud computing paradigm. In this overview paper, we review recent developments in GenAI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Robotics and Automated Systems
