Asynchronous Distributed Bilevel Optimization

Yang Jiao; Kai Yang; Tiancheng Wu; Dongjin Song; Chengtao Jian

arXiv:2212.10048·cs.LG·February 27, 2023·6 cites

Asynchronous Distributed Bilevel Optimization

Yang Jiao, Kai Yang, Tiancheng Wu, Dongjin Song, Chengtao Jian

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces ADBO, an asynchronous distributed algorithm for bilevel optimization that addresses communication and straggler issues, with proven convergence and empirical validation on public datasets.

Contribution

It proposes the first asynchronous distributed bilevel optimization algorithm with convergence guarantees and practical efficiency improvements.

Findings

01

Convergence to an $ ext{epsilon}$-stationary point is theoretically guaranteed.

02

Iteration complexity is bounded by $ ext{O}(1/ ext{epsilon}^2)$.

03

Empirical results demonstrate effectiveness and efficiency on public datasets.

Abstract

Bilevel optimization plays an essential role in many machine learning tasks, ranging from hyperparameter optimization to meta-learning. Existing studies on bilevel optimization, however, focus on either centralized or synchronous distributed setting. The centralized bilevel optimization approaches require collecting massive amount of data to a single server, which inevitably incur significant communication expenses and may give rise to data privacy risks. Synchronous distributed bilevel optimization algorithms, on the other hand, often face the straggler problem and will immediately stop working if a few workers fail to respond. As a remedy, we propose Asynchronous Distributed Bilevel Optimization (ADBO) algorithm. The proposed ADBO can tackle bilevel optimization problems with both nonconvex upper-level and lower-level objective functions, and its convergence is theoretically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iclr23submission6251/adbo
pytorchOfficial

Videos

Asynchronous Distributed Bilevel Optimization· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Pancreatic and Hepatic Oncology Research · Advanced Bandit Algorithms Research

Methodsfail