A General Theory for Client Sampling in Federated Learning
Yann Fraboni, Richard Vidal, Laetitia Kameni, Marco Lorenzi

TL;DR
This paper develops a comprehensive theoretical framework to analyze how client sampling schemes and client heterogeneity affect convergence in federated learning, providing insights into optimal sampling strategies.
Contribution
It introduces a unified theory linking sampling schemes, heterogeneity, and convergence, and compares Multinomial and Uniform sampling in various scenarios.
Findings
Multinomial sampling is more resilient to data ratio changes.
Uniform sampling performs best when clients have equal data amounts.
The covariance of aggregation weights significantly impacts convergence.
Abstract
While client sampling is a central operation of current state-of-the-art federated learning (FL) approaches, the impact of this procedure on the convergence and speed of FL remains under-investigated. In this work, we provide a general theoretical framework to quantify the impact of a client sampling scheme and of the clients heterogeneity on the federated optimization. First, we provide a unified theoretical ground for previously reported sampling schemes experimental results on the relationship between FL convergence and the variance of the aggregation weights. Second, we prove for the first time that the quality of FL convergence is also impacted by the resulting covariance between aggregation weights. Our theory is general, and is here applied to Multinomial Distribution (MD) and Uniform sampling, two default unbiased client sampling schemes of FL, and demonstrated through a series…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management · Mobile Crowdsensing and Crowdsourcing
