Faster On-Device Training Using New Federated Momentum Algorithm
Zhouyuan Huo, Qian Yang, Bin Gu, Lawrence Carin. Heng Huang

TL;DR
This paper introduces a new federated learning algorithm with momentum that guarantees convergence for non-convex problems and accelerates training, demonstrated through experiments on benchmark datasets.
Contribution
It presents a novel accelerated federated learning algorithm with proven convergence guarantees for non-convex problems, advancing on existing methods.
Findings
The new algorithm converges faster than previous approaches.
Federated averaging is proven to converge for non-convex problems.
Experimental results validate the improved convergence speed.
Abstract
Mobile crowdsensing has gained significant attention in recent years and has become a critical paradigm for emerging Internet of Things applications. The sensing devices continuously generate a significant quantity of data, which provide tremendous opportunities to develop innovative intelligent applications. To utilize these data to train machine learning models while not compromising user privacy, federated learning has become a promising solution. However, there is little understanding of whether federated learning algorithms are guaranteed to converge. We reconsider model averaging in federated learning and formulate it as a gradient-based method with biased gradients. This novel perspective assists analysis of its convergence rate and provides a new direction for more acceleration. We prove for the first time that the federated averaging algorithm is guaranteed to converge for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Stochastic Gradient Optimization Techniques
