Achieving Linear Speedup in Non-IID Federated Bilevel Learning
Minhui Huang, Dewei Zhang, Kaiyi Ji

TL;DR
This paper introduces FedMBO, a novel federated bilevel optimization algorithm that achieves linear speedup in convergence rate for non-i.i.d. datasets, addressing key open problems in federated learning.
Contribution
We propose FedMBO with a new client sampling scheme, providing the first theoretical linear speedup results for non-i.i.d. federated bilevel optimization.
Findings
FedMBO achieves a convergence rate of O(1/√(nK) + 1/K + √n/K^{3/2}) on non-i.i.d. datasets.
FedMBO demonstrates linear speedup with respect to the number of clients in theory.
Experimental results validate the theoretical convergence rate and effectiveness of FedMBO.
Abstract
Federated bilevel optimization has received increasing attention in various emerging machine learning and communication applications. Recently, several Hessian-vector-based algorithms have been proposed to solve the federated bilevel optimization problem. However, several important properties in federated learning such as the partial client participation and the linear speedup for convergence (i.e., the convergence rate and complexity are improved linearly with respect to the number of sampled clients) in the presence of non-i.i.d.~datasets, still remain open. In this paper, we fill these gaps by proposing a new federated bilevel algorithm named FedMBO with a novel client sampling scheme in the federated hypergradient estimation. We show that FedMBO achieves a convergence rate of on non-i.i.d.~datasets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Gallbladder and Bile Duct Disorders · Cooperative Communication and Network Coding
