FedStein: Enhancing Multi-Domain Federated Learning Through James-Stein Estimator
Sunny Gupta, Nikita Jangid, Amit Sethi

TL;DR
FedStein introduces a novel federated learning method that leverages the James-Stein estimator to improve multi-domain data generalization by sharing only specific batch normalization statistics, leading to significant accuracy gains.
Contribution
This paper proposes FedStein, a new federated learning approach that enhances multi-domain performance by sharing James-Stein estimates of BN statistics, addressing domain heterogeneity.
Findings
FedStein outperforms FedAvg and FedBN in accuracy.
Achieves over 14% accuracy improvement in some domains.
Enhances domain generalization across multiple datasets.
Abstract
Federated Learning (FL) facilitates data privacy by enabling collaborative in-situ training across decentralized clients. Despite its inherent advantages, FL faces significant challenges of performance and convergence when dealing with data that is not independently and identically distributed (non-i.i.d.). While previous research has primarily addressed the issue of skewed label distribution across clients, this study focuses on the less explored challenge of multi-domain FL, where client data originates from distinct domains with varying feature distributions. We introduce a novel method designed to address these challenges FedStein: Enhancing Multi-Domain Federated Learning Through the James-Stein Estimator. FedStein uniquely shares only the James-Stein (JS) estimates of batch normalization (BN) statistics across clients, while maintaining local BN parameters. The non-BN layer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques
MethodsBatch Normalization
