FedHQ: Hybrid Runtime Quantization for Federated Learning
Zihao Zheng, Ziyao Wang, Xiuping Cui, Maoliang Li, Jiayu Chen, Yun (Eric) Liang, Ang Li, Xiang Chen

TL;DR
FedHQ introduces a hybrid quantization framework combining PTQ and QAT to enhance federated learning efficiency and accuracy, addressing device and data heterogeneity with adaptive strategy allocation.
Contribution
This paper presents FedHQ, a novel framework that automatically optimizes hybrid quantization strategies for federated learning, balancing speed and accuracy across diverse settings.
Findings
Achieves up to 2.47x training acceleration.
Improves accuracy by up to 11.15%.
Maintains negligible additional overhead.
Abstract
Federated Learning (FL) is a decentralized model training approach that preserves data privacy but struggles with low efficiency. Quantization, a powerful training optimization technique, has been widely explored for integration into FL. However, many studies fail to consider the distinct performance attribution between particular quantization strategies, such as post-training quantization (PTQ) or quantization-aware training (QAT). As a result, existing FL quantization methods rely solely on either PTQ or QAT, optimizing for speed or accuracy while compromising the other. To efficiently accelerate FL and maintain distributed convergence accuracy across various FL settings, this paper proposes a hybrid quantitation approach combining PTQ and QAT for FL systems. We conduct case studies to validate the effectiveness of using hybrid quantization in FL. To solve the difficulty of modeling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Big Data and Digital Economy · Cryptography and Data Security
MethodsADaptive gradient method with the OPTimal convergence rate · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
