Order Optimal Bounds for One-Shot Federated Learning over non-Convex Loss Functions
Arsalan Sharifnassab, Saber Salehkaleybar, S. Jamaloddin Golestani

TL;DR
This paper establishes fundamental limits and proposes an optimal algorithm for one-shot federated learning with non-convex loss functions, balancing communication constraints and sample size to minimize expected loss.
Contribution
It derives the first order-optimal bounds for one-shot federated learning with non-convex losses and introduces the MRE-NC algorithm matching these bounds.
Findings
Lower bound on expected loss: (rac{1}{\u221a{n}(mB)^{1/d}}, rac{1}{{mn}})
Proposed MRE-NC algorithm achieves near-optimal expected loss in large {mn} regime
Results highlight the trade-off between communication budget and sample size in federated learning
Abstract
We consider the problem of federated learning in a one-shot setting in which there are machines, each observing sample functions from an unknown distribution on non-convex loss functions. Let be the expected loss function with respect to this unknown distribution. The goal is to find an estimate of the minimizer of . Based on its observations, each machine generates a signal of bounded length and sends it to a server. The server collects signals of all machines and outputs an estimate of the minimizer of . We show that the expected loss of any algorithm is lower bounded by , up to a logarithmic factor. We then prove that this lower bound is order optimal in and by presenting a distributed learning algorithm, called Multi-Resolution Estimator for Non-Convex loss function (MRE-NC), whose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Distributed Sensor Networks and Detection Algorithms · Domain Adaptation and Few-Shot Learning
