Communication-Efficient Federated Bilevel Optimization with Local and Global Lower Level Problems
Junyi Li, Feihu Huang, Heng Huang

TL;DR
This paper introduces FedBiOAcc, a communication-efficient algorithm for federated bilevel optimization that achieves linear speedup and demonstrates superior empirical performance in real-world tasks.
Contribution
It proposes FedBiOAcc, the first communication-efficient federated bilevel optimization algorithm with accelerated hyper-gradient estimation and convergence guarantees.
Findings
Achieves $O(rac{1}{ ext{epsilon}})$ communication complexity.
Demonstrates linear speedup with the number of clients.
Outperforms existing methods in federated data-cleaning and hyper-representation learning.
Abstract
Bilevel Optimization has witnessed notable progress recently with new emerging efficient algorithms. However, its application in the Federated Learning setting remains relatively underexplored, and the impact of Federated Learning's inherent challenges on the convergence of bilevel algorithms remain obscure. In this work, we investigate Federated Bilevel Optimization problems and propose a communication-efficient algorithm, named FedBiOAcc. The algorithm leverages an efficient estimation of the hyper-gradient in the distributed setting and utilizes the momentum-based variance-reduction acceleration. Remarkably, FedBiOAcc achieves a communication complexity , a sample complexity and the linear speed up with respect to the number of clients. We also analyze a special case of the Federated Bilevel Optimization problems, where lower level problems are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Bone and Joint Diseases
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
