FedCluster: Boosting the Convergence of Federated Learning via Cluster-Cycling
Cheng Chen, Ziyi Chen, Yi Zhou, Bhavya Kailkhura

TL;DR
FedCluster introduces a cyclic clustering approach in federated learning, significantly enhancing convergence speed by organizing devices into clusters that perform cyclic meta-updates, especially effective under data heterogeneity.
Contribution
This paper proposes FedCluster, a novel federated learning framework that improves convergence efficiency through device clustering and cyclic updates, with theoretical and empirical validation.
Findings
FedCluster converges faster than FedAvg in nonconvex optimization.
The framework is effective across diverse data heterogeneity levels.
Experimental results on deep learning tasks confirm improved convergence speed.
Abstract
We develop FedCluster--a novel federated learning framework with improved optimization efficiency, and investigate its theoretical convergence properties. The FedCluster groups the devices into multiple clusters that perform federated learning cyclically in each learning round. Therefore, each learning round of FedCluster consists of multiple cycles of meta-update that boost the overall convergence. In nonconvex optimization, we show that FedCluster with the devices implementing the local {stochastic gradient descent (SGD)} algorithm achieves a faster convergence rate than the conventional {federated averaging (FedAvg)} algorithm in the presence of device-level data heterogeneity. We conduct experiments on deep learning applications and demonstrate that FedCluster converges significantly faster than the conventional federated learning under diverse levels of device-level data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Age of Information Optimization
