Clustered Federated Learning based on Nonconvex Pairwise Fusion
Xue Yu, Ziyi Liu, Wu Wang, Yifan Sun

TL;DR
This paper introduces a novel clustered federated learning framework that autonomously identifies device clusters using nonconvex pairwise penalties, reducing communication costs and ensuring privacy, with proven convergence and superior experimental performance.
Contribution
It proposes FPFC, a new federated clustering method based on nonconvex penalties, with a parallelizable ADMM-based implementation and theoretical convergence guarantees.
Findings
FPFC outperforms existing methods in experiments.
The framework effectively identifies clusters without prior knowledge.
Theoretical analysis confirms convergence and statistical rate.
Abstract
This study investigates clustered federated learning (FL), one of the formulations of FL with non-i.i.d. data, where the devices are partitioned into clusters and each cluster optimally fits its data with a localized model. We propose a clustered FL framework that incorporates a nonconvex penalty to pairwise differences of parameters. Without a priori knowledge of the set of devices in each cluster and the number of clusters, this framework can autonomously estimate cluster structures. To implement the proposed framework, we introduce a novel clustered FL method called Fusion Penalized Federated Clustering (FPFC). Building upon the standard alternating direction method of multipliers (ADMM), FPFC can perform partial updates at each communication round and allows parallel computation with variable workload. These strategies significantly reduce the communication cost while ensuring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cooperative Communication and Network Coding · Stochastic Gradient Optimization Techniques
MethodsAlternating Direction Method of Multipliers
