FedVSSAM: Mitigating Flatness Incompatibility in Sharpness-Aware Federated Learning

Bingnan Xiao; Yuan Gao; Bingcong Li; Wei Ni; Xin Wang; Tony Q. S. Quek

arXiv:2605.09144·cs.LG·May 12, 2026

FedVSSAM: Mitigating Flatness Incompatibility in Sharpness-Aware Federated Learning

Bingnan Xiao, Yuan Gao, Bingcong Li, Wei Ni, Xin Wang, Tony Q. S. Quek

PDF

TL;DR

FedVSSAM introduces a novel approach to address flatness incompatibility in federated learning, improving global model generalization under data heterogeneity by stabilizing local updates.

Contribution

The paper proposes FedVSSAM, a variance-suppressed sharpness-aware method that aligns local and global directions to mitigate flatness incompatibility in federated learning.

Findings

01

FedVSSAM effectively mitigates flatness incompatibility.

02

It outperforms baseline methods in diverse FL settings.

03

Theoretical convergence guarantees are established.

Abstract

Sharpness-aware minimization (SAM) is an effective method for improving the generalization of federated learning (FL) by steering local training toward flat minima. Under data heterogeneity, however, device-side SAM searches for locally flat basins that are incompatible with the flat region preferred by the global objective. We identify this structural failure mode as flatness incompatibility, which explains why improving local flatness alone may provide limited training and generalization improvement for the global model. We reveal that flatness incompatibility arises from data heterogeneity and the friendly adversary phenomenon, and is further amplified by local updates and partial device participation. To mitigate this issue, we propose Federated Learning with variance-suppressed sharpness-aware minimization (FedVSSAM), which constructs a variance-suppressed adjusted direction and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.