Federated Learning with Non-IID Data
Yue Zhao, Meng Li, Liangzhen Lai, Naveen Suda, Damon Civin, Vikas, Chandra

TL;DR
This paper investigates the significant accuracy degradation in federated learning with non-IID data, identifies weight divergence as a key factor, and proposes a small shared data subset to improve model performance.
Contribution
It introduces a novel approach of sharing a small data subset among devices to mitigate non-IID data challenges in federated learning.
Findings
Accuracy drops up to 55% with highly skewed non-IID data.
Shared small data subset improves accuracy by 30%.
Weight divergence explains the accuracy reduction.
Abstract
Federated learning enables resource-constrained edge compute devices, such as mobile phones and IoT devices, to learn a shared model for prediction, while keeping the training data local. This decentralized approach to train models provides privacy, security, regulatory and economic benefits. In this work, we focus on the statistical challenge of federated learning when local data is non-IID. We first show that the accuracy of federated learning reduces significantly, by up to 55% for neural networks trained for highly skewed non-IID data, where each client device trains only on a single class of data. We further show that this accuracy reduction can be explained by the weight divergence, which can be quantified by the earth mover's distance (EMD) between the distribution over classes on each device and the population distribution. As a solution, we propose a strategy to improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Stochastic Gradient Optimization Techniques
