Clustered Federated Learning: Model-Agnostic Distributed Multi-Task   Optimization under Privacy Constraints

Felix Sattler; Klaus-Robert M\"uller; Wojciech Samek

arXiv:1910.01991·cs.LG·October 7, 2019

Clustered Federated Learning: Model-Agnostic Distributed Multi-Task Optimization under Privacy Constraints

Felix Sattler, Klaus-Robert M\"uller, Wojciech Samek

PDF

2 Repos

TL;DR

This paper introduces Clustered Federated Learning (CFL), a novel framework that groups clients based on data distribution similarities to improve model performance in federated settings, especially with non-i.i.d. data.

Contribution

CFL is a model-agnostic, privacy-preserving clustering method that enhances federated learning by exploiting loss surface geometry without altering communication protocols.

Findings

01

CFL improves model accuracy over standard FL in non-i.i.d. scenarios.

02

Theoretical guarantees on clustering quality are provided.

03

Experimental results on neural networks validate CFL's effectiveness.

Abstract

Federated Learning (FL) is currently the most widely adopted framework for collaborative training of (deep) machine learning models under privacy constraints. Albeit it's popularity, it has been observed that Federated Learning yields suboptimal results if the local clients' data distributions diverge. To address this issue, we present Clustered Federated Learning (CFL), a novel Federated Multi-Task Learning (FMTL) framework, which exploits geometric properties of the FL loss surface, to group the client population into clusters with jointly trainable data distributions. In contrast to existing FMTL approaches, CFL does not require any modifications to the FL communication protocol to be made, is applicable to general non-convex objectives (in particular deep neural networks) and comes with strong mathematical guarantees on the clustering quality. CFL is flexible enough to handle client…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.