An Overview on Generative AI at Scale with Edge-Cloud Computing

Yun-Cheng Wang; Jintang Xue; Chengwei Wei; C.-C. Jay Kuo

arXiv:2306.17170·cs.DC·July 11, 2023·2 cites

An Overview on Generative AI at Scale with Edge-Cloud Computing

Yun-Cheng Wang, Jintang Xue, Chengwei Wei, C.-C. Jay Kuo

PDF

Open Access

TL;DR

This paper reviews recent advances in generative AI and edge-cloud computing, discussing technical challenges and design considerations for scaling GenAI systems efficiently at the edge and cloud interface.

Contribution

It provides a comprehensive overview of integrating GenAI with edge-cloud computing, highlighting challenges and proposing future research directions.

Findings

01

Edge-cloud collaboration reduces latency for GenAI services.

02

Technical challenges include data management and system scalability.

03

Design considerations improve training and deployment at scale.

Abstract

As a specific category of artificial intelligence (AI), generative artificial intelligence (GenAI) generates new content that resembles what is created by humans. The rapid development of GenAI systems has created a huge amount of new data on the Internet, posing new challenges to current computing and communication frameworks. Currently, GenAI services rely on the traditional cloud computing framework due to the need for large computation resources. However, such services will encounter high latency because of data transmission and a high volume of requests. On the other hand, edge-cloud computing can provide adequate computation power and low latency at the same time through the collaboration between edges and the cloud. Thus, it is attractive to build GenAI systems at scale by leveraging the edge-cloud computing paradigm. In this overview paper, we review recent developments in GenAI…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Robotics and Automated Systems