Efficient Data Distribution Estimation for Accelerated Federated   Learning

Yuanli Wang; Lei Huang

arXiv:2406.01774·cs.DC·June 5, 2024

Efficient Data Distribution Estimation for Accelerated Federated Learning

Yuanli Wang, Lei Huang

PDF

Open Access

TL;DR

This paper introduces an efficient data distribution estimation method to significantly reduce overhead in large-scale federated learning, improving device selection efficiency and model training speed.

Contribution

It proposes a novel data distribution summary calculation algorithm that drastically cuts down the time required for client selection in federated learning.

Findings

01

Achieves up to 30x reduction in data summary time.

02

Achieves up to 360x reduction in clustering time.

03

Enhances scalability of federated learning systems.

Abstract

Federated Learning(FL) is a privacy-preserving machine learning paradigm where a global model is trained in-situ across a large number of distributed edge devices. These systems are often comprised of millions of user devices and only a subset of available devices can be used for training in each epoch. Designing a device selection strategy is challenging, given that devices are highly heterogeneous in both their system resources and training data. This heterogeneity makes device selection very crucial for timely model convergence and sufficient model accuracy. To tackle the FL client heterogeneity problem, various client selection algorithms have been developed, showing promising performance improvement in terms of model coverage and accuracy. In this work, we study the overhead of client selection algorithms in a large scale FL environment. Then we propose an efficient data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks