FedHQ: Hybrid Runtime Quantization for Federated Learning

Zihao Zheng; Ziyao Wang; Xiuping Cui; Maoliang Li; Jiayu Chen; Yun (Eric) Liang; Ang Li; Xiang Chen

arXiv:2505.11982·cs.LG·May 20, 2025

FedHQ: Hybrid Runtime Quantization for Federated Learning

Zihao Zheng, Ziyao Wang, Xiuping Cui, Maoliang Li, Jiayu Chen, Yun (Eric) Liang, Ang Li, Xiang Chen

PDF

Open Access

TL;DR

FedHQ introduces a hybrid quantization framework combining PTQ and QAT to enhance federated learning efficiency and accuracy, addressing device and data heterogeneity with adaptive strategy allocation.

Contribution

This paper presents FedHQ, a novel framework that automatically optimizes hybrid quantization strategies for federated learning, balancing speed and accuracy across diverse settings.

Findings

01

Achieves up to 2.47x training acceleration.

02

Improves accuracy by up to 11.15%.

03

Maintains negligible additional overhead.

Abstract

Federated Learning (FL) is a decentralized model training approach that preserves data privacy but struggles with low efficiency. Quantization, a powerful training optimization technique, has been widely explored for integration into FL. However, many studies fail to consider the distinct performance attribution between particular quantization strategies, such as post-training quantization (PTQ) or quantization-aware training (QAT). As a result, existing FL quantization methods rely solely on either PTQ or QAT, optimizing for speed or accuracy while compromising the other. To efficiently accelerate FL and maintain distributed convergence accuracy across various FL settings, this paper proposes a hybrid quantitation approach combining PTQ and QAT for FL systems. We conduct case studies to validate the effectiveness of using hybrid quantization in FL. To solve the difficulty of modeling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Big Data and Digital Economy · Cryptography and Data Security

MethodsADaptive gradient method with the OPTimal convergence rate · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings