Window-based Model Averaging Improves Generalization in Heterogeneous   Federated Learning

Debora Caldarola; Barbara Caputo; Marco Ciccone

arXiv:2310.01366·cs.LG·October 11, 2023·1 cites

Window-based Model Averaging Improves Generalization in Heterogeneous Federated Learning

Debora Caldarola, Barbara Caputo, Marco Ciccone

PDF

Open Access

TL;DR

WIMA enhances federated learning by aggregating models over a window of rounds, improving robustness and stability without extra communication, especially under data heterogeneity and distribution shifts.

Contribution

The paper introduces WIMA, a window-based model averaging method that improves federated learning robustness and generalization without additional communication overhead.

Findings

01

WIMA improves stability under data heterogeneity.

02

WIMA enhances robustness against distribution shifts.

03

WIMA integrates easily with existing algorithms.

Abstract

Federated Learning (FL) aims to learn a global model from distributed users while protecting their privacy. However, when data are distributed heterogeneously the learning process becomes noisy, unstable, and biased towards the last seen clients' data, slowing down convergence. To address these issues and improve the robustness and generalization capabilities of the global model, we propose WIMA (Window-based Model Averaging). WIMA aggregates global models from different rounds using a window-based approach, effectively capturing knowledge from multiple users and reducing the bias from the last ones. By adopting a windowed view on the rounds, WIMA can be applied from the initial stages of training. Importantly, our method introduces no additional communication or client-side computation overhead. Our experiments demonstrate the robustness of WIMA against distribution shifts and bad…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Advanced Graph Neural Networks · Machine Learning in Healthcare