DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning
Robert H\"onig, Yiren Zhao, Robert Mullins

TL;DR
DAdaQuant introduces a doubly-adaptive quantization method for federated learning that dynamically adjusts quantization levels over time and across clients, significantly reducing communication costs while maintaining model quality.
Contribution
The paper proposes a novel doubly-adaptive quantization algorithm for federated learning, combining time and client adaptivity to enhance compression efficiency.
Findings
DAdaQuant outperforms non-adaptive baselines by up to 2.8x in communication compression.
Dynamic adaptation of quantization levels improves federated learning efficiency.
The approach maintains model quality despite increased compression.
Abstract
Federated Learning (FL) is a powerful technique for training a model on a server with data from several clients in a privacy-preserving manner. In FL, a server sends the model to every client, who then train the model locally and send it back to the server. The server aggregates the updated models and repeats the process for several rounds. FL incurs significant communication costs, in particular when transmitting the updated local models from the clients back to the server. Recently proposed algorithms quantize the model parameters to efficiently compress FL communication. These algorithms typically have a quantization level that controls the compression factor. We find that dynamic adaptations of the quantization level can boost compression without sacrificing model quality. First, we introduce a time-adaptive quantization algorithm that increases the quantization level as training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
