FedBN: Federated Learning on Non-IID Features via Local Batch Normalization
Xiaoxiao Li, Meirui Jiang, Xiaofei Zhang, Michael Kamp, Qi Dou

TL;DR
FedBN introduces a local batch normalization technique to improve federated learning performance under feature shift non-iid data distributions, demonstrating faster convergence and superior results over existing methods.
Contribution
This work presents FedBN, a novel federated learning approach that uses local batch normalization to handle feature shift non-iid data, improving convergence and accuracy.
Findings
FedBN outperforms FedAvg and FedProx on non-iid data.
FedBN achieves faster convergence rates.
Empirical results validate the effectiveness of local batch normalization.
Abstract
The emerging paradigm of federated learning (FL) strives to enable collaborative training of deep models on the network edge without centrally aggregating raw data and hence improving data privacy. In most cases, the assumption of independent and identically distributed samples across local clients does not hold for federated learning setups. Under this setting, neural network training performance may vary significantly according to the data distribution and even hurt training convergence. Most of the previous work has focused on a difference in the distribution of labels or client shifts. Unlike those settings, we address an important problem of FL, e.g., different scanners/sensors in medical imaging, different scenery distribution in autonomous driving (highway vs. city), where local clients store examples with different distributions compared to other clients, which we denote as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques
MethodsBatch Normalization
