P$^2$U: Progressive Precision Update For Efficient Model Distribution
Homayun Afrabandpey, Hamed Rezazadegan Tavakoli

TL;DR
P$^2$U is a method that improves model distribution efficiency by transmitting a low-precision model and a model update, balancing accuracy and bandwidth in resource-constrained environments.
Contribution
The paper introduces P$^2$U, a novel progressive precision update technique that enhances model distribution efficiency and can be combined with existing compression methods.
Findings
P$^2$U achieves better accuracy-bandwidth tradeoffs across various models and datasets.
Aggressive quantization (e.g., 4-bit) can be used without significant performance loss.
P$^2$U is effective for federated learning, edge computing, and IoT deployments.
Abstract
Efficient model distribution is becoming increasingly critical in bandwidth-constrained environments. In this paper, we propose a simple yet effective approach called Progressive Precision Update (PU) to address this problem. Instead of transmitting the original high-precision model, PU transmits a lower-bit precision model, coupled with a model update representing the difference between the original high-precision model and the transmitted low precision version. With extensive experiments on various model architectures, ranging from small models ( million parameters) to a large model (more than million parameters) and using three different data sets, e.g., chest X-Ray, PASCAL-VOC, and CIFAR-100, we demonstrate that PU consistently achieves better tradeoff between accuracy, bandwidth usage and latency. Moreover, we show that when bandwidth or startup time is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Privacy-Preserving Technologies in Data
