Lessons from Generalization Error Analysis of Federated Learning: You May Communicate Less Often!
Milad Sefidgaran, Romain Chor, Abdellatif Zaidi, Yijun Wan

TL;DR
This paper analyzes how the frequency of communication rounds in federated learning affects the generalization error, revealing that more frequent communication can reduce generalization performance, with theoretical bounds and empirical evidence supporting this.
Contribution
The paper introduces the first PAC-Bayes and rate-distortion bounds on generalization error in federated learning, explicitly incorporating communication frequency effects.
Findings
Generalization error increases with more communication rounds for FSVM.
Faster decrease of population risk compared to empirical risk with increasing R.
Experimental results suggest population risk may increase beyond certain communication frequency.
Abstract
We investigate the generalization error of statistical learning models in a Federated Learning (FL) setting. Specifically, we study the evolution of the generalization error with the number of communication rounds between clients and a parameter server (PS), i.e., the effect on the generalization error of how often the clients' local models are aggregated at PS. In our setup, the more the clients communicate with PS the less data they use for local training in each round, such that the amount of training data per client is identical for distinct values of . We establish PAC-Bayes and rate-distortion theoretic bounds on the generalization error that account explicitly for the effect of the number of rounds , in addition to the number of participating devices and individual datasets size . The bounds, which apply to a large class of loss functions and learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Wireless Communication Security Techniques
