On the Convergence of Distributed Stochastic Bilevel Optimization   Algorithms over a Network

Hongchang Gao; Bin Gu; My T. Thai

arXiv:2206.15025·cs.LG·March 28, 2023·1 cites

On the Convergence of Distributed Stochastic Bilevel Optimization Algorithms over a Network

Hongchang Gao, Bin Gu, My T. Thai

PDF

Open Access

TL;DR

This paper introduces two new decentralized stochastic bilevel optimization algorithms for networked data, providing the first convergence rate analysis for nonconvex-strongly-convex problems and demonstrating their effectiveness in machine learning tasks.

Contribution

The paper develops the first decentralized stochastic bilevel algorithms with convergence guarantees for nonconvex-strongly-convex problems, expanding bilevel optimization to distributed settings.

Findings

01

Established convergence rates for the proposed algorithms.

02

Validated effectiveness through experiments on machine learning models.

Abstract

Bilevel optimization has been applied to a wide variety of machine learning models, and numerous stochastic bilevel optimization algorithms have been developed in recent years. However, most existing algorithms restrict their focus on the single-machine setting so that they are incapable of handling the distributed data. To address this issue, under the setting where all participants compose a network and perform peer-to-peer communication in this network, we developed two novel decentralized stochastic bilevel optimization algorithms based on the gradient tracking communication mechanism and two different gradient estimators. Additionally, we established their convergence rates for nonconvex-strongly-convex problems with novel theoretical analysis strategies. To our knowledge, this is the first work achieving these theoretical results. Finally, we applied our algorithms to practical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM