FedStaleWeight: Buffered Asynchronous Federated Learning with Fair Aggregation via Staleness Reweighting
Jeffrey Ma, Alan Tu, Yiling Chen, Vijay Janapa Reddi

TL;DR
FedStaleWeight introduces a fair aggregation method for asynchronous federated learning that uses staleness-based reweighting to improve fairness and convergence without incentivizing false reporting.
Contribution
The paper proposes FedStaleWeight, a novel algorithm that ensures fair aggregation in asynchronous federated learning by leveraging staleness, with theoretical guarantees and empirical validation.
Findings
FedStaleWeight achieves stronger fairness in aggregation.
It accelerates convergence to higher model accuracy.
The method maintains incentive compatibility for truthful reporting.
Abstract
Federated Learning (FL) endeavors to harness decentralized data while preserving privacy, facing challenges of performance, scalability, and collaboration. Asynchronous Federated Learning (AFL) methods have emerged as promising alternatives to their synchronous counterparts bounded by the slowest agent, yet they add additional challenges in convergence guarantees, fairness with respect to compute heterogeneity, and incorporation of staleness in aggregated updates. Specifically, AFL biases model training heavily towards agents who can produce updates faster, leaving slower agents behind, who often also have differently distributed data which is not learned by the global model. Naively upweighting introduces incentive issues, where true fast updating agents may falsely report updates at a slower speed to increase their contribution to model training. We introduce FedStaleWeight, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Cryptography and Data Security
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
