Efficient Multiuser AI Downloading via Reusable Knowledge Broadcasting
Hai Wu, Qunsong Zeng, and Kaibin Huang

TL;DR
This paper introduces the MBA framework for multiuser AI model downloading in 6G networks, significantly reducing latency by leveraging reusable shared knowledge and optimized broadcasting strategies.
Contribution
It presents the first reusable knowledge-based broadcasting framework with efficient algorithms for parameter selection and power control, enhancing multiuser model downloading.
Findings
Substantial latency reduction demonstrated in experiments.
Efficient algorithms for parameter selection and power control.
Framework guarantees device performance while minimizing download time.
Abstract
For the 6G mobile networks, in-situ model downloading has emerged as an important use case to enable real-time adaptive artificial intelligence on edge devices. However, the simultaneous downloading of diverse and high-dimensional models to multiple devices over wireless links presents a significant communication bottleneck. To overcome the bottleneck, we propose the framework of model broadcasting and assembling (MBA), which represents the first attempt on leveraging reusable knowledge, referring to shared parameters among tasks, to enable parameter broadcasting to reduce communication overhead. The MBA framework comprises two key components. The first, the MBA protocol, defines the system operations including parameter selection from a model library, power control for broadcasting, and model assembling at devices. The second component is the joint design of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · IoT and Edge/Fog Computing · Age of Information Optimization
