TL;DR
This paper systematically evaluates two state-of-the-art Clustered Federated Learning algorithms across various data heterogeneity scenarios, providing insights into their performance and the impact of data distribution differences.
Contribution
It introduces a taxonomy for data heterogeneity in federated learning and assesses CFL algorithms' performance across these scenarios using multiple datasets.
Findings
CFL algorithms perform variably depending on heterogeneity type
The proposed taxonomy clarifies heterogeneity scenarios in FL
Performance metrics reveal strengths and limitations of CFL methods
Abstract
Over recent years, Federated Learning (FL) has proven to be one of the most promising methods of distributed learning which preserves data privacy. As the method evolved and was confronted to various real-world scenarios, new challenges have emerged. One such challenge is the presence of highly heterogeneous (often referred as non-IID) data distributions among participants of the FL protocol. A popular solution to this hurdle is Clustered Federated Learning (CFL), which aims to partition clients into groups where the distribution are homogeneous. In the literature, state-of-the-art CFL algorithms are often tested using a few cases of data heterogeneities, without systematically justifying the choices. Further, the taxonomy used for differentiating the different heterogeneity scenarios is not always straightforward. In this paper, we explore the performance of two state-of-theart CFL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
