On the Convergence of Federated Averaging under Partial Participation   for Over-parameterized Neural Networks

Xin Liu; Wei li; Dazhi Zhan; Yu Pan; Xin Ma; Yu Ding; Zhisong Pan

arXiv:2310.05495·cs.LG·October 30, 2024

On the Convergence of Federated Averaging under Partial Participation for Over-parameterized Neural Networks

Xin Liu, Wei li, Dazhi Zhan, Yu Pan, Xin Ma, Yu Ding, Zhisong Pan

PDF

Open Access

TL;DR

This paper provides theoretical convergence guarantees for federated averaging in over-parameterized neural networks under partial client participation, supported by experimental validation.

Contribution

It establishes the first convergence analysis of FedAvg with partial participation for over-parameterized neural networks, including deep linear and two-layer ReLU models.

Findings

01

FedAvg converges linearly under partial participation in over-parameterized settings.

02

Convergence rate depends on the minimum number of participating clients per iteration.

03

Experimental results validate the theoretical convergence guarantees.

Abstract

Federated learning (FL) is a widely employed distributed paradigm for collaboratively training machine learning models from multiple clients without sharing local data. In practice, FL encounters challenges in dealing with partial client participation due to the limited bandwidth, intermittent connection and strict synchronized delay. Simultaneously, there exist few theoretical convergence guarantees in this practical setting, especially when associated with the non-convex optimization of neural networks. To bridge this gap, we focus on the training problem of federated averaging (FedAvg) method for two canonical models: a deep linear network and a two-layer ReLU network. Under the over-parameterized assumption, we provably show that FedAvg converges to a global minimum at a linear rate $O ((1 - \frac{mi n _{i \in [t]} ∣ S _{i} ∣}{N ^{2}})^{t})$ after $t$ iterations, where $N$ is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsFocus · Neural Tangent Kernel