Federated Learning with Non-IID Data

Yue Zhao; Meng Li; Liangzhen Lai; Naveen Suda; Damon Civin; Vikas; Chandra

arXiv:1806.00582·cs.LG·July 22, 2022·1.9k cites

Federated Learning with Non-IID Data

Yue Zhao, Meng Li, Liangzhen Lai, Naveen Suda, Damon Civin, Vikas, Chandra

PDF

Open Access 2 Repos

TL;DR

This paper investigates the significant accuracy degradation in federated learning with non-IID data, identifies weight divergence as a key factor, and proposes a small shared data subset to improve model performance.

Contribution

It introduces a novel approach of sharing a small data subset among devices to mitigate non-IID data challenges in federated learning.

Findings

01

Accuracy drops up to 55% with highly skewed non-IID data.

02

Shared small data subset improves accuracy by 30%.

03

Weight divergence explains the accuracy reduction.

Abstract

Federated learning enables resource-constrained edge compute devices, such as mobile phones and IoT devices, to learn a shared model for prediction, while keeping the training data local. This decentralized approach to train models provides privacy, security, regulatory and economic benefits. In this work, we focus on the statistical challenge of federated learning when local data is non-IID. We first show that the accuracy of federated learning reduces significantly, by up to 55% for neural networks trained for highly skewed non-IID data, where each client device trains only on a single class of data. We further show that this accuracy reduction can be explained by the weight divergence, which can be quantified by the earth mover's distance (EMD) between the distribution over classes on each device and the population distribution. As a solution, we propose a strategy to improve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Stochastic Gradient Optimization Techniques