FedEve: On Bridging the Client Drift and Period Drift for Cross-device Federated Learning
Tao Shen, Zexi Li, Didi Zhu, Ziyu Zhao, Chao Wu, Fei Wu

TL;DR
This paper introduces FedEve, a novel method for cross-device federated learning that addresses both client and period drift, improving model convergence and performance on heterogeneous data.
Contribution
FedEve is the first approach to jointly mitigate client drift and period drift in cross-device FL, enhancing robustness against data heterogeneity.
Findings
FedEve reduces model update variance.
Outperforms existing methods on non-iid data.
Effectively mitigates the impact of period and client drift.
Abstract
Federated learning (FL) is a machine learning paradigm that allows multiple clients to collaboratively train a shared model without exposing their private data. Data heterogeneity is a fundamental challenge in FL, which can result in poor convergence and performance degradation. Client drift has been recognized as one of the factors contributing to this issue resulting from the multiple local updates in FedAvg. However, in cross-device FL, a different form of drift arises due to the partial client participation, but it has not been studied well. This drift, we referred as period drift, occurs as participating clients at each communication round may exhibit distinct data distribution that deviates from that of all clients. It could be more harmful than client drift since the optimization objective shifts with every round. In this paper, we investigate the interaction between period…
Peer Reviews
Decision·ICLR 2024 Conference Withdrawn Submission
Refer to summary
Refer to summary
1. Using Bayesian filter to compensate two sources of drift is novel. 2. The connection between server momentum and Kalman Filter is interesting. 3. The paper is written clearly.
1. The so called ``period drift'' comes from the stochastic sampling of clients. If we see sampling clients as sampling data in SGD, such a period drift also happens during SGD -- each batch of data has distinct data distribution from other batches. Authors should provide a more rigorous definition of period drift and show that how the period drift harms training. 2. The Figure 3 shows the period drift that the sampled data on one client varies across different rounds. This may still be similar
- This work is well organized and written. - This work proposes a very simple and effective method based on the Bayesian filter (or Kalman filter). The experimental results support their claim. - The proposed "period drift" concept is good for federatede learning community to further study. Authors are encouraged to open-source their source codes for FedEvE and other compared methods which helps to broaden the influence of this work. - Personally I like the analysis of Kalman Gain a lot : ) - O
- Since this work is tightly related to the client selection, so the random seeds to conduct experiments on their proposed method and baseline methods should be given to increase the reproducibility. - The total client number for other datasets (CIFAR10/100) seems not given.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Advanced Data Storage Technologies · Internet Traffic Analysis and Secure E-voting
