A Penalty-Based Method for Communication-Efficient Decentralized Bilevel Programming
Parvin Nazari, Ahmad Mousavi, Davoud Ataee Tarzanagh, and George, Michailidis

TL;DR
This paper presents a communication-efficient decentralized algorithm for bilevel programming that leverages penalty functions and hyper-gradient estimation, achieving theoretical convergence guarantees and practical effectiveness.
Contribution
It introduces a novel penalty-based decentralized bilevel optimization algorithm with theoretical convergence analysis and improved iteration complexity.
Findings
Achieves non-asymptotic convergence to the optimal solution.
Reduces communication costs through hyper-gradient estimation.
Performs well in real-world experiments.
Abstract
Bilevel programming has recently received attention in the literature due to its wide range of applications, including reinforcement learning and hyper-parameter optimization. However, it is widely assumed that the underlying bilevel optimization problem is solved either by a single machine or, in the case of multiple machines connected in a star-shaped network, i.e., in a federated learning setting. The latter approach suffers from a high communication cost on the central node (e.g., parameter server). Hence, there is an interest in developing methods that solve bilevel optimization problems in a communication-efficient, decentralized manner. To that end, this paper introduces a penalty function-based decentralized algorithm with theoretical guarantees for this class of optimization problems. Specifically, a distributed alternating gradient-type algorithm for solving consensus bilevel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpinal Dysraphism and Malformations
