A review on different techniques used to combat the non-IID and   heterogeneous nature of data in FL

Venkataraman Natarajan Iyer

arXiv:2401.00809·cs.LG·January 2, 2024·5 cites

A review on different techniques used to combat the non-IID and heterogeneous nature of data in FL

Venkataraman Natarajan Iyer

PDF

Open Access

TL;DR

This paper reviews various techniques to address the challenges posed by non-IID and heterogeneous data distributions in Federated Learning, emphasizing the importance of privacy and model convergence.

Contribution

It provides a comprehensive overview of current algorithms designed to mitigate non-IID and heterogeneity issues in Federated Learning.

Findings

01

Identifies key challenges of non-IID data in FL

02

Summarizes existing algorithms addressing data heterogeneity

03

Highlights gaps and future directions in FL research

Abstract

Federated Learning (FL) is a machine-learning approach enabling collaborative model training across multiple decentralized edge devices that hold local data samples, all without exchanging these samples. This collaborative process occurs under the supervision of a central server orchestrating the training or via a peer-to-peer network. The significance of FL is particularly pronounced in industries such as healthcare and finance, where data privacy holds paramount importance. However, training a model under the Federated learning setting brings forth several challenges, with one of the most prominent being the heterogeneity of data distribution among the edge devices. The data is typically non-independently and non-identically distributed (non-IID), thereby presenting challenges to model convergence. This report delves into the issues arising from non-IID and heterogeneous data and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Traffic Prediction and Management Techniques