A Penalty-Based Method for Communication-Efficient Decentralized Bilevel   Programming

Parvin Nazari; Ahmad Mousavi; Davoud Ataee Tarzanagh; and George; Michailidis

arXiv:2211.04088·cs.LG·October 11, 2024·1 cites

A Penalty-Based Method for Communication-Efficient Decentralized Bilevel Programming

Parvin Nazari, Ahmad Mousavi, Davoud Ataee Tarzanagh, and George, Michailidis

PDF

Open Access

TL;DR

This paper presents a communication-efficient decentralized algorithm for bilevel programming that leverages penalty functions and hyper-gradient estimation, achieving theoretical convergence guarantees and practical effectiveness.

Contribution

It introduces a novel penalty-based decentralized bilevel optimization algorithm with theoretical convergence analysis and improved iteration complexity.

Findings

01

Achieves non-asymptotic convergence to the optimal solution.

02

Reduces communication costs through hyper-gradient estimation.

03

Performs well in real-world experiments.

Abstract

Bilevel programming has recently received attention in the literature due to its wide range of applications, including reinforcement learning and hyper-parameter optimization. However, it is widely assumed that the underlying bilevel optimization problem is solved either by a single machine or, in the case of multiple machines connected in a star-shaped network, i.e., in a federated learning setting. The latter approach suffers from a high communication cost on the central node (e.g., parameter server). Hence, there is an interest in developing methods that solve bilevel optimization problems in a communication-efficient, decentralized manner. To that end, this paper introduces a penalty function-based decentralized algorithm with theoretical guarantees for this class of optimization problems. Specifically, a distributed alternating gradient-type algorithm for solving consensus bilevel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpinal Dysraphism and Malformations