Generalized Federated Learning via Sharpness Aware Minimization
Zhe Qu, Xingyu Li, Rui Duan, Yao Liu, Bo Tang, and Zhuo Lu

TL;DR
This paper introduces FedSAM and MoFedSAM, novel federated learning algorithms utilizing Sharpness Aware Minimization to improve model generalization and reduce deviation caused by data heterogeneity.
Contribution
It proposes a new FL optimization approach with SAM, providing convergence analysis and generalization bounds, outperforming existing methods.
Findings
FedSAM outperforms existing FL algorithms in experiments.
MoFedSAM effectively bridges local and global models.
Both algorithms reduce deviation caused by data heterogeneity.
Abstract
Federated Learning (FL) is a promising framework for performing privacy-preserving, distributed learning with a set of clients. However, the data distribution among clients often exhibits non-IID, i.e., distribution shift, which makes efficient optimization difficult. To tackle this problem, many FL algorithms focus on mitigating the effects of data heterogeneity across clients by increasing the performance of the global model. However, almost all algorithms leverage Empirical Risk Minimization (ERM) to be the local optimizer, which is easy to make the global model fall into a sharp valley and increase a large deviation of parts of local clients. Therefore, in this paper, we revisit the solutions to the distribution shift problem in FL with a focus on local learning generality. To this end, we propose a general, effective algorithm, \texttt{FedSAM}, based on Sharpness Aware Minimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks
MethodsAttentive Walk-Aggregating Graph Neural Network
