Edge Large AI Models: Revolutionizing 6G Networks

Zixin Wang; Yuanming Shi; Yong Zhou; Jingyang Zhu; and Khaled. B.; Letaief

arXiv:2505.00321·cs.NI·May 2, 2025

Edge Large AI Models: Revolutionizing 6G Networks

Zixin Wang, Yuanming Shi, Yong Zhou, Jingyang Zhu, and Khaled. B., Letaief

PDF

TL;DR

This paper explores the deployment of large AI models at the edge for 6G networks, addressing challenges like resource constraints and proposing frameworks for model training, deployment, and applications such as channel prediction.

Contribution

It introduces collaborative fine-tuning, full-parameter training, and microservice-based inference architectures for edge large AI models in 6G, along with application insights for air-interface design.

Findings

01

Proposed frameworks improve edge LAM deployment efficiency.

02

Microservice architecture enhances inference scalability.

03

Application of edge LAM in channel prediction and beamforming.

Abstract

Large artificial intelligence models (LAMs) possess human-like abilities to solve a wide range of real-world problems, exemplifying the potential of experts in various domains and modalities. By leveraging the communication and computation capabilities of geographically dispersed edge devices, edge LAM emerges as an enabling technology to empower the delivery of various real-time intelligent services in 6G. Unlike traditional edge artificial intelligence (AI) that primarily supports a single task using small models, edge LAM is featured by the need of the decomposition and distributed deployment of large models, and the ability to support highly generalized and diverse tasks. However, due to limited communication, computation, and storage resources over wireless networks, the vast number of trainable neurons and the substantial communication overhead pose a formidable hurdle to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.