Locally Estimated Global Perturbations are Better than Local   Perturbations for Federated Sharpness-aware Minimization

Ziqing Fan; Shengchao Hu; Jiangchao Yao; Gang Niu; Ya Zhang; Masashi; Sugiyama; Yanfeng Wang

arXiv:2405.18890·cs.LG·May 30, 2024·1 cites

Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization

Ziqing Fan, Shengchao Hu, Jiangchao Yao, Gang Niu, Ya Zhang, Masashi, Sugiyama, Yanfeng Wang

PDF

Open Access 1 Repo

TL;DR

FedLESAM introduces a novel method for federated sharpness-aware minimization by estimating global perturbations locally, improving model performance and efficiency in heterogeneous federated learning environments.

Contribution

The paper proposes FedLESAM, a new algorithm that estimates global perturbations locally, enhancing federated sharpness-aware minimization with theoretical and empirical validation.

Findings

01

FedLESAM outperforms existing methods on benchmark datasets.

02

It speeds up training by reducing backpropagation steps.

03

It achieves tighter theoretical bounds on perturbation consistency.

Abstract

In federated learning (FL), the multi-step update and data heterogeneity among clients often lead to a loss landscape with sharper minima, degenerating the performance of the resulted global model. Prevalent federated approaches incorporate sharpness-aware minimization (SAM) into local training to mitigate this problem. However, the local loss landscapes may not accurately reflect the flatness of global loss landscape in heterogeneous environments; as a result, minimizing local sharpness and calculating perturbations on client data might not align the efficacy of SAM in FL with centralized training. To overcome this challenge, we propose FedLESAM, a novel algorithm that locally estimates the direction of global perturbation on client side as the difference between global models received in the previous active and current rounds. Besides the improved quality, FedLESAM also speed up…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MediaBrain-SJTU/FedLESAM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Ferroelectric and Negative Capacitance Devices · Privacy-Preserving Technologies in Data

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sharpness-Aware Minimization · ALIGN · Segment Anything Model