Accelerating Federated Learning with a Global Biased Optimiser
Jed Mills, Jia Hu, Geyong Min, Rui Jin, Siwei Zheng, Jin Wang

TL;DR
This paper introduces FedGBO, a novel federated learning optimizer that accelerates training and improves convergence on non-IID data by using global biased optimizer values, with theoretical convergence guarantees and extensive empirical validation.
Contribution
FedGBO is a new adaptive optimizer for federated learning that reduces client drift and accelerates convergence on non-IID data, with proven theoretical convergence and superior empirical performance.
Findings
FedGBO outperforms existing FL algorithms on multiple benchmarks.
FedGBO reduces data upload and computational costs.
Theoretical convergence of FedGBO on nonconvex objectives is established.
Abstract
Federated Learning (FL) is a recent development in distributed machine learning that collaboratively trains models without training data leaving client devices, preserving data privacy. In real-world FL, the training set is distributed over clients in a highly non-Independent and Identically Distributed (non-IID) fashion, harming model convergence speed and final performance. To address this challenge, we propose a novel, generalised approach for incorporating adaptive optimisation into FL with the Federated Global Biased Optimiser (FedGBO) algorithm. FedGBO accelerates FL by employing a set of global biased optimiser values during training, reducing 'client-drift' from non-IID data whilst benefiting from adaptive optimisation. We show that in FedGBO, updates to the global model can be reformulated as centralised training using biased gradients and optimiser updates, and apply this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
