A survey on Clustered Federated Learning: Taxonomy, Analysis and Applications
Michael Ben Ali (IRIT), Omar El-Rifai (CIS-ENSMSE), Imen Megdiche (IRIT-SIG, INUC), Andr\'e Peninou (IRIT-SIG, UT2J), Olivier Teste (IRIT-SIG)

TL;DR
This survey systematically reviews Clustered Federated Learning, classifies algorithms into a taxonomy, and highlights the gap between theoretical privacy-focused methods and practical efficiency-driven applications.
Contribution
It introduces a clear taxonomy for CFL algorithms, distinguishes core data heterogeneity from operational variants, and analyzes the current research and application landscape.
Findings
Theoretical research emphasizes privacy-preserving methods.
Real-world applications favor efficiency and metadata-based approaches.
A clear distinction exists between Core CFL and operational variants.
Abstract
As Federated Learning (FL) expands, the challenge of non-independent and identically distributed (non-IID) data becomes critical. Clustered Federated Learning (CFL) addresses this by training multiple specialized models, each representing a group of clients with similar data distributions. However, the term ''CFL'' has increasingly been applied to operational strategies unrelated to data heterogeneity, creating significant ambiguity. This survey provides a systematic review of the CFL literature and introduces a principled taxonomy that classifies algorithms into Server-side, Client-side, and Metadata-based approaches. Our analysis reveals a distinct dichotomy: while theoretical research prioritizes privacy-preserving Server/Client-side methods, real-world applications in IoT, Mobility, and Energy overwhelmingly favor Metadata-based efficiency. Furthermore, we explicitly distinguish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
