Efficient Multiuser AI Downloading via Reusable Knowledge Broadcasting

Hai Wu; Qunsong Zeng; and Kaibin Huang

arXiv:2307.15316·cs.IT·July 31, 2023

Efficient Multiuser AI Downloading via Reusable Knowledge Broadcasting

Hai Wu, Qunsong Zeng, and Kaibin Huang

PDF

Open Access

TL;DR

This paper introduces the MBA framework for multiuser AI model downloading in 6G networks, significantly reducing latency by leveraging reusable shared knowledge and optimized broadcasting strategies.

Contribution

It presents the first reusable knowledge-based broadcasting framework with efficient algorithms for parameter selection and power control, enhancing multiuser model downloading.

Findings

01

Substantial latency reduction demonstrated in experiments.

02

Efficient algorithms for parameter selection and power control.

03

Framework guarantees device performance while minimizing download time.

Abstract

For the 6G mobile networks, in-situ model downloading has emerged as an important use case to enable real-time adaptive artificial intelligence on edge devices. However, the simultaneous downloading of diverse and high-dimensional models to multiple devices over wireless links presents a significant communication bottleneck. To overcome the bottleneck, we propose the framework of model broadcasting and assembling (MBA), which represents the first attempt on leveraging reusable knowledge, referring to shared parameters among tasks, to enable parameter broadcasting to reduce communication overhead. The MBA framework comprises two key components. The first, the MBA protocol, defines the system operations including parameter selection from a model library, power control for broadcasting, and model assembling at devices. The second component is the joint design of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCaching and Content Delivery · IoT and Edge/Fog Computing · Age of Information Optimization