Parametric-Sensitivity Aware Retransmission for Efficient AI Downloading
You Zhou, Qunsong Zeng, Kaibin Huang

TL;DR
This paper introduces PASAR, a retransmission framework that prioritizes important model parameters based on sensitivity, significantly improving AI model downloading efficiency over wireless channels.
Contribution
The paper presents a novel sensitivity-aware retransmission protocol that adaptively manages packet retransmissions based on parametric importance, reducing latency and improving efficiency.
Findings
PASAR outperforms classical HARQ schemes in efficiency and latency.
Most model parameters have low sensitivity, allowing selective retransmission.
Adaptive retransmission reduces communication overhead while maintaining model accuracy.
Abstract
The edge artificial intelligence (AI) applications in next-generation mobile networks demand efficient AI-model downloading techniques to support real-time, on-device inference. However, transmitting high-dimensional AI models over wireless channels remains challenging due to limited communication resources. To address this issue, we propose a parametric-sensitivity-aware retransmission (PASAR) framework that manages radio-resource usage of different parameter packets according to their importance on model inference accuracy, known as parametric sensitivity. Empirical analysis reveals a highly right-skewed sensitivity distribution, indicating that only a small fraction of parameters significantly affect model performance. Leveraging this insight, we design a novel online retransmission protocol, i.e., the PASAR protocol, that adaptively terminates packet transmission based on real-time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · IoT Networks and Protocols · Wireless Signal Modulation Classification
