Clustered Data Sharing for Non-IID Federated Learning over Wireless   Networks

Gang Hu; Yinglei Teng; Nan Wang; F. Richard Yu

arXiv:2302.10747·cs.LG·December 20, 2023

Clustered Data Sharing for Non-IID Federated Learning over Wireless Networks

Gang Hu, Yinglei Teng, Nan Wang, F. Richard Yu

PDF

Open Access

TL;DR

This paper introduces a clustered data sharing framework for federated learning over wireless networks, improving model accuracy and convergence on non-IID data through device-to-device communication and adaptive clustering.

Contribution

It proposes a novel privacy-preserving clustered data sharing method with an adaptive clustering algorithm tailored for non-IID federated learning scenarios.

Findings

01

Enhanced convergence of FL on non-IID data

02

Improved model accuracy with limited communication

03

Effective data sharing via device-to-device communication

Abstract

Federated Learning (FL) is a novel distributed machine learning approach to leverage data from Internet of Things (IoT) devices while maintaining data privacy. However, the current FL algorithms face the challenges of non-independent and identically distributed (non-IID) data, which causes high communication costs and model accuracy declines. To address the statistical imbalances in FL, we propose a clustered data sharing framework which spares the partial data from cluster heads to credible associates through device-to-device (D2D) communication. Moreover, aiming at diluting the data skew on nodes, we formulate the joint clustering and data sharing problem based on the privacy-preserving constrained graph. To tackle the serious coupling of decisions on the graph, we devise a distribution-based adaptive clustering algorithm (DACA) basing on three deductive cluster-forming conditions,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data