Toward Resource-Efficient Collaboration of Large AI Models in Mobile Edge Networks
Peichun Li, Liping Qian, Dusit Niyato, Shiwen Mao, Yuan Wu

TL;DR
This paper reviews and advances methods for resource-efficient collaboration of large AI models in mobile edge networks, enabling effective deployment despite limited resources, with a new multi-stage diffusion framework demonstrating improved efficiency and adaptability.
Contribution
It provides a comprehensive overview of collaborative AI system architectures and introduces a novel multi-stage diffusion framework for elastic distribution of large models at the edge.
Findings
The multi-stage diffusion framework improves efficiency and adaptability.
Experimental results show enhanced data generation performance.
Various spatial and temporal collaboration techniques are discussed.
Abstract
The collaboration of large artificial intelligence (AI) models in mobile edge networks has emerged as a promising paradigm to meet the growing demand for intelligent services at the network edge. By enabling multiple devices to cooperatively execute submodels or subtasks, collaborative AI enhances inference efficiency and service quality with constrained resources. However, deploying large AI models in such environments remains challenging due to the intrinsic mismatch between model complexity and the limited computation, memory, and communication resources in edge networks. This article provides a comprehensive overview of the system architecture for collaborative AI in mobile edge networks, along with representative application scenarios in transportation and healthcare. We further present recent advances in resource-efficient collaboration techniques, categorized into spatial and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpportunistic and Delay-Tolerant Networks · Advanced Data and IoT Technologies · Vehicular Ad Hoc Networks (VANETs)
