FedCM: Federated Learning with Client-level Momentum

Jing Xu; Sen Wang; Liwei Wang; Andrew Chi-Chih Yao

arXiv:2106.10874·cs.LG·June 22, 2021·23 cites

FedCM: Federated Learning with Client-level Momentum

Jing Xu, Sen Wang, Liwei Wang, Andrew Chi-Chih Yao

PDF

Open Access 2 Repos

TL;DR

FedCM introduces a novel federated learning algorithm that incorporates client-level momentum to address partial participation and heterogeneity, improving stability and performance in real-world scenarios.

Contribution

The paper proposes FedCM, a new federated learning method that uses client-level momentum to enhance stability and accuracy under diverse client conditions.

Findings

01

FedCM outperforms existing methods in various tasks.

02

FedCM is robust to different client participation levels.

03

FedCM effectively corrects bias caused by client heterogeneity.

Abstract

Federated Learning is a distributed machine learning approach which enables model training without data sharing. In this paper, we propose a new federated learning algorithm, Federated Averaging with Client-level Momentum (FedCM), to tackle problems of partial participation and client heterogeneity in real-world federated learning applications. FedCM aggregates global gradient information in previous communication rounds and modifies client gradient descent with a momentum-like term, which can effectively correct the bias and improve the stability of local SGD. We provide theoretical analysis to highlight the benefits of FedCM. We also perform extensive empirical studies and demonstrate that FedCM achieves superior performance in various tasks and is robust to different levels of client numbers, participation rate and client heterogeneity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques

MethodsStochastic Gradient Descent