Clustered Data Sharing for Non-IID Federated Learning over Wireless Networks
Gang Hu, Yinglei Teng, Nan Wang, F. Richard Yu

TL;DR
This paper introduces a clustered data sharing framework for federated learning over wireless networks, improving model accuracy and convergence on non-IID data through device-to-device communication and adaptive clustering.
Contribution
It proposes a novel privacy-preserving clustered data sharing method with an adaptive clustering algorithm tailored for non-IID federated learning scenarios.
Findings
Enhanced convergence of FL on non-IID data
Improved model accuracy with limited communication
Effective data sharing via device-to-device communication
Abstract
Federated Learning (FL) is a novel distributed machine learning approach to leverage data from Internet of Things (IoT) devices while maintaining data privacy. However, the current FL algorithms face the challenges of non-independent and identically distributed (non-IID) data, which causes high communication costs and model accuracy declines. To address the statistical imbalances in FL, we propose a clustered data sharing framework which spares the partial data from cluster heads to credible associates through device-to-device (D2D) communication. Moreover, aiming at diluting the data skew on nodes, we formulate the joint clustering and data sharing problem based on the privacy-preserving constrained graph. To tackle the serious coupling of decisions on the graph, we devise a distribution-based adaptive clustering algorithm (DACA) basing on three deductive cluster-forming conditions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
