Federated Learning with Buffered Asynchronous Aggregation
John Nguyen, Kshitiz Malik, Hongyuan Zhan, Ashkan Yousefpour, Michael, Rabbat, Mani Malek, Dzmitry Huba

TL;DR
This paper introduces FedBuff, a buffered asynchronous aggregation method for federated learning that improves scalability and efficiency while maintaining privacy guarantees, outperforming traditional synchronous and asynchronous approaches.
Contribution
FedBuff is a novel aggregation method that combines synchronous and asynchronous FL benefits, compatible with privacy-preserving techniques, and backed by theoretical convergence guarantees.
Findings
FedBuff is 3.3x more efficient than synchronous FL.
FedBuff is up to 2.5x more efficient than asynchronous FL.
FedBuff maintains privacy guarantees with secure aggregation and differential privacy.
Abstract
Scalability and privacy are two critical concerns for cross-device federated learning (FL) systems. In this work, we identify that synchronous FL - synchronized aggregation of client updates in FL - cannot scale efficiently beyond a few hundred clients training in parallel. It leads to diminishing returns in model performance and training speed, analogous to large-batch training. On the other hand, asynchronous aggregation of client updates in FL (i.e., asynchronous FL) alleviates the scalability issue. However, aggregating individual client updates is incompatible with Secure Aggregation, which could result in an undesirable level of privacy for the system. To address these concerns, we propose a novel buffered asynchronous aggregation method, FedBuff, that is agnostic to the choice of optimizer, and combines the best properties of synchronous and asynchronous FL. We empirically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Cryptography and Data Security
