Generative AI as a Service in 6G Edge-Cloud: Generation Task Offloading by In-context Learning
Hao Zhou, Chengming Hu, Dun Yuan, Ye Yuan, Di Wu, Xue Liu, Zhu Han,, and Charlie Zhang

TL;DR
This paper proposes a novel edge-cloud deployment of foundation models in 6G networks, using in-context learning for task offloading to minimize service delay without requiring model training or fine-tuning.
Contribution
It introduces a new in-context learning based method for task offloading in 6G edge-cloud networks, optimizing content generation delay without model training.
Findings
Effective reduction in service delay demonstrated through simulations
In-context learning enables task offloading without model fine-tuning
Proposed system maintains high generation quality in 6G edge-cloud environments
Abstract
Generative artificial intelligence (GAI) is a promising technique towards 6G networks, and generative foundation models such as large language models (LLMs) have attracted considerable interest from academia and telecom industry. This work considers a novel edge-cloud deployment of foundation models in 6G networks. Specifically, it aims to minimize the service delay of foundation models by radio resource allocation and task offloading, i.e., offloading diverse content generation tasks to proper LLMs at the network edge or cloud. In particular, we first introduce the communication system model, i.e., allocating radio resources and calculating link capacity to support generated content transmission, and then we present the LLM inference model to calculate the delay of content generation. After that, we propose a novel in-context learning method to optimize the task offloading decisions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Age of Information Optimization · Privacy-Preserving Technologies in Data
Methodstravel james
