P$^2$U: Progressive Precision Update For Efficient Model Distribution

Homayun Afrabandpey; Hamed Rezazadegan Tavakoli

arXiv:2506.22871·cs.LG·July 1, 2025

P$^2$U: Progressive Precision Update For Efficient Model Distribution

Homayun Afrabandpey, Hamed Rezazadegan Tavakoli

PDF

Open Access

TL;DR

P$^2$U is a method that improves model distribution efficiency by transmitting a low-precision model and a model update, balancing accuracy and bandwidth in resource-constrained environments.

Contribution

The paper introduces P$^2$U, a novel progressive precision update technique that enhances model distribution efficiency and can be combined with existing compression methods.

Findings

01

P$^2$U achieves better accuracy-bandwidth tradeoffs across various models and datasets.

02

Aggressive quantization (e.g., 4-bit) can be used without significant performance loss.

03

P$^2$U is effective for federated learning, edge computing, and IoT deployments.

Abstract

Efficient model distribution is becoming increasingly critical in bandwidth-constrained environments. In this paper, we propose a simple yet effective approach called Progressive Precision Update (P $^{2}$ U) to address this problem. Instead of transmitting the original high-precision model, P $^{2}$ U transmits a lower-bit precision model, coupled with a model update representing the difference between the original high-precision model and the transmitted low precision version. With extensive experiments on various model architectures, ranging from small models ( $1 - 6$ million parameters) to a large model (more than $100$ million parameters) and using three different data sets, e.g., chest X-Ray, PASCAL-VOC, and CIFAR-100, we demonstrate that P $^{2}$ U consistently achieves better tradeoff between accuracy, bandwidth usage and latency. Moreover, we show that when bandwidth or startup time is the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Privacy-Preserving Technologies in Data