Tackling Intertwined Data and Device Heterogeneities in Federated Learning with Unlimited Staleness
Haoming Wang, Wei Gao

TL;DR
This paper introduces a federated learning framework that effectively handles intertwined data and device heterogeneities with unlimited staleness by estimating data distributions from stale updates, improving accuracy and efficiency without extra overhead.
Contribution
The proposed framework uniquely estimates data distributions from stale updates to convert them into unstale updates, addressing intertwined heterogeneities in federated learning.
Findings
Improves model accuracy by up to 25%.
Reduces training epochs by up to 35%.
Does not require auxiliary datasets or full local models.
Abstract
Federated Learning (FL) can be affected by data and device heterogeneities, caused by clients' different local data distributions and latencies in uploading model updates (i.e., staleness). Traditional schemes consider these heterogeneities as two separate and independent aspects, but this assumption is unrealistic in practical FL scenarios where these heterogeneities are intertwined. In these cases, traditional FL schemes are ineffective, and a better approach is to convert a stale model update into a unstale one. In this paper, we present a new FL framework that ensures the accuracy and computational efficiency of this conversion, hence effectively tackling the intertwined heterogeneities that may cause unlimited staleness in model updates. Our basic idea is to estimate the distributions of clients' local training data from their uploaded stale model updates, and use these estimations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Internet Traffic Analysis and Secure E-voting
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
