On the Communication Complexity of Decentralized Bilevel Optimization
Yihan Zhang, My T. Thai, Jie Wu, Hongchang Gao

TL;DR
This paper introduces two new decentralized stochastic bilevel optimization algorithms that achieve faster convergence and lower communication costs in heterogeneous settings, with theoretical guarantees and empirical validation.
Contribution
The paper presents the first decentralized bilevel algorithms with convergence guarantees under mild heterogeneity assumptions, improving efficiency over existing methods.
Findings
Faster convergence rates than existing methods.
Lower communication costs in heterogeneous environments.
Empirical results confirm algorithm efficacy.
Abstract
Stochastic bilevel optimization finds widespread applications in machine learning, including meta-learning, hyperparameter optimization, and neural architecture search. To extend stochastic bilevel optimization to distributed data, several decentralized stochastic bilevel optimization algorithms have been developed. However, existing methods often suffer from slow convergence rates and high communication costs in heterogeneous settings, limiting their applicability to real-world tasks. To address these issues, we propose two novel decentralized stochastic bilevel gradient descent algorithms based on simultaneous and alternating update strategies. Our algorithms can achieve faster convergence rates and lower communication costs than existing methods. Importantly, our convergence analyses do not rely on strong assumptions regarding heterogeneity. More importantly, our theoretical analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Optimization and Variational Analysis · Economic theories and models
