Beyond Local Sharpness: Communication-Efficient Global Sharpness-aware   Minimization for Federated Learning

Debora Caldarola; Pietro Cagnasso; Barbara Caputo; Marco Ciccone

arXiv:2412.03752·cs.AI·April 1, 2025

Beyond Local Sharpness: Communication-Efficient Global Sharpness-aware Minimization for Federated Learning

Debora Caldarola, Pietro Cagnasso, Barbara Caputo, Marco Ciccone

PDF

Open Access 2 Repos

TL;DR

FedGloSS is a federated learning method that optimizes global sharpness on the server using a communication-efficient approximation, leading to flatter minima and improved model performance.

Contribution

Introduces FedGloSS, a novel server-side sharpness-aware minimization approach that reduces communication costs while enhancing global model flatness in federated learning.

Findings

01

FedGloSS achieves flatter minima than existing methods.

02

It outperforms state-of-the-art FL approaches on vision benchmarks.

03

The method reduces communication overhead significantly.

Abstract

Federated learning (FL) enables collaborative model training with privacy preservation. Data heterogeneity across edge devices (clients) can cause models to converge to sharp minima, negatively impacting generalization and robustness. Recent approaches use client-side sharpness-aware minimization (SAM) to encourage flatter minima, but the discrepancy between local and global loss landscapes often undermines their effectiveness, as optimizing for local sharpness does not ensure global flatness. This work introduces FedGloSS (Federated Global Server-side Sharpness), a novel FL approach that prioritizes the optimization of global sharpness on the server, using SAM. To reduce communication overhead, FedGloSS cleverly approximates sharpness using the previous global gradient, eliminating the need for additional client communication. Our extensive evaluations demonstrate that FedGloSS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks

MethodsSharpness-Aware Minimization · Segment Anything Model