Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices
Huancheng Chen, Haris Vikalo

TL;DR
This paper introduces FedMPQ, a federated learning algorithm that employs mixed-precision quantization to reduce communication costs on heterogeneous devices, achieving better performance than fixed-precision methods with minimal overhead.
Contribution
The paper proposes a novel federated learning algorithm, FedMPQ, that applies mixed-precision quantization with adaptive bit-width assignment to improve efficiency on resource-heterogeneous devices.
Findings
FedMPQ outperforms fixed-precision schemes in various benchmarks.
The method incurs only minor computational overhead.
Effective in both iid and non-iid data settings.
Abstract
While federated learning (FL) systems often utilize quantization to battle communication and computational bottlenecks, they have heretofore been limited to deploying fixed-precision quantization schemes. Meanwhile, the concept of mixed-precision quantization (MPQ), where different layers of a deep learning model are assigned varying bit-width, remains unexplored in the FL settings. We present a novel FL algorithm, FedMPQ, which introduces mixed-precision quantization to resource-heterogeneous FL systems. Specifically, local models, quantized so as to satisfy bit-width constraint, are trained by optimizing an objective function that includes a regularization term which promotes reduction of precision in some of the layers without significant performance degradation. The server collects local model updates, de-quantizes them into full-precision models, and then aggregates them into a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques
