Order Optimal Bounds for One-Shot Federated Learning over non-Convex   Loss Functions

Arsalan Sharifnassab; Saber Salehkaleybar; S. Jamaloddin Golestani

arXiv:2108.08677·cs.LG·February 7, 2024·1 cites

Order Optimal Bounds for One-Shot Federated Learning over non-Convex Loss Functions

Arsalan Sharifnassab, Saber Salehkaleybar, S. Jamaloddin Golestani

PDF

Open Access

TL;DR

This paper establishes fundamental limits and proposes an optimal algorithm for one-shot federated learning with non-convex loss functions, balancing communication constraints and sample size to minimize expected loss.

Contribution

It derives the first order-optimal bounds for one-shot federated learning with non-convex losses and introduces the MRE-NC algorithm matching these bounds.

Findings

01

Lower bound on expected loss: (rac{1}{\u221a{n}(mB)^{1/d}}, rac{1}{{mn}})

02

Proposed MRE-NC algorithm achieves near-optimal expected loss in large {mn} regime

03

Results highlight the trade-off between communication budget and sample size in federated learning

Abstract

We consider the problem of federated learning in a one-shot setting in which there are $m$ machines, each observing $n$ sample functions from an unknown distribution on non-convex loss functions. Let $F : [- 1, 1]^{d} \to R$ be the expected loss function with respect to this unknown distribution. The goal is to find an estimate of the minimizer of $F$ . Based on its observations, each machine generates a signal of bounded length $B$ and sends it to a server. The server collects signals of all machines and outputs an estimate of the minimizer of $F$ . We show that the expected loss of any algorithm is lower bounded by $max (1/ (n (m B)^{1/ d}), 1/ mn)$ , up to a logarithmic factor. We then prove that this lower bound is order optimal in $m$ and $n$ by presenting a distributed learning algorithm, called Multi-Resolution Estimator for Non-Convex loss function (MRE-NC), whose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Distributed Sensor Networks and Detection Algorithms · Domain Adaptation and Few-Shot Learning