FedStaleWeight: Buffered Asynchronous Federated Learning with Fair   Aggregation via Staleness Reweighting

Jeffrey Ma; Alan Tu; Yiling Chen; Vijay Janapa Reddi

arXiv:2406.02877·cs.LG·June 6, 2024·1 cites

FedStaleWeight: Buffered Asynchronous Federated Learning with Fair Aggregation via Staleness Reweighting

Jeffrey Ma, Alan Tu, Yiling Chen, Vijay Janapa Reddi

PDF

Open Access 1 Repo

TL;DR

FedStaleWeight introduces a fair aggregation method for asynchronous federated learning that uses staleness-based reweighting to improve fairness and convergence without incentivizing false reporting.

Contribution

The paper proposes FedStaleWeight, a novel algorithm that ensures fair aggregation in asynchronous federated learning by leveraging staleness, with theoretical guarantees and empirical validation.

Findings

01

FedStaleWeight achieves stronger fairness in aggregation.

02

It accelerates convergence to higher model accuracy.

03

The method maintains incentive compatibility for truthful reporting.

Abstract

Federated Learning (FL) endeavors to harness decentralized data while preserving privacy, facing challenges of performance, scalability, and collaboration. Asynchronous Federated Learning (AFL) methods have emerged as promising alternatives to their synchronous counterparts bounded by the slowest agent, yet they add additional challenges in convergence guarantees, fairness with respect to compute heterogeneity, and incorporation of staleness in aggregated updates. Specifically, AFL biases model training heavily towards agents who can produce updates faster, leaving slower agents behind, who often also have differently distributed data which is not learned by the global model. Naively upweighting introduces incentive issues, where true fast updating agents may falsely report updates at a slower speed to increase their contribution to model training. We introduce FedStaleWeight, an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

18jeffreyma/afl-bench
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Cryptography and Data Security

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings