Federated learning with incremental clustering for heterogeneous data
Fabiola Espinoza Castellon, Aurelien Mayoue, Jacques-Henri, Sublemontier, Cedric Gouy-Pailler

TL;DR
This paper introduces FLIC, a federated learning method that incrementally clusters clients based on their updates, improving performance on heterogeneous data without extra communication, and demonstrating robustness against poisoning attacks.
Contribution
FLIC enables client clustering during federated training using update information, avoiding the need for simultaneous parameter transmission and enhancing robustness against malicious clients.
Findings
Successfully clusters clients with similar data distributions in non-IID scenarios.
Maintains performance without additional communication overhead.
Provides robustness against poisoning attacks even with over 50% malicious clients.
Abstract
Federated learning enables different parties to collaboratively build a global model under the orchestration of a server while keeping the training data on clients' devices. However, performance is affected when clients have heterogeneous data. To cope with this problem, we assume that despite data heterogeneity, there are groups of clients who have similar data distributions that can be clustered. In previous approaches, in order to cluster clients the server requires clients to send their parameters simultaneously. However, this can be problematic in a context where there is a significant number of participants that may have limited availability. To prevent such a bottleneck, we propose FLIC (Federated Learning with Incremental Clustering), in which the server exploits the updates sent by clients during federated training instead of asking them to send their parameters simultaneously.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Graph Neural Networks · HIV, Drug Use, Sexual Risk
