Overcoming the Challenges of Batch Normalization in Federated Learning
Rachid Guerraoui, Rafael Pinot, Geovani Rizk, John Stephan,, Fran\c{c}ois Taiani

TL;DR
This paper introduces Federated BatchNorm (FBN), a novel method that adapts batch normalization for federated learning, effectively addressing data heterogeneity and covariate shift to improve model performance.
Contribution
The paper proposes FBN, a new scheme that maintains batch normalization benefits in federated learning by aligning training and centralized statistics, even under high data heterogeneity.
Findings
FBN reduces covariate shift in federated learning.
FBN achieves performance comparable to centralized training.
Enhanced FBN can mitigate erroneous statistics and adversarial attacks.
Abstract
Batch normalization has proven to be a very beneficial mechanism to accelerate the training and improve the accuracy of deep neural networks in centralized environments. Yet, the scheme faces significant challenges in federated learning, especially under high data heterogeneity. Essentially, the main challenges arise from external covariate shifts and inconsistent statistics across clients. We introduce in this paper Federated BatchNorm (FBN), a novel scheme that restores the benefits of batch normalization in federated learning. Essentially, FBN ensures that the batch normalization during training is consistent with what would be achieved in a centralized execution, hence preserving the distribution of the data, and providing running statistics that accurately approximate the global statistics. FBN thereby reduces the external covariate shift and matches the evaluation performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Privacy-Preserving Technologies in Data · Machine Learning and Data Classification
MethodsBatch Normalization
