Single-loop Projection-free and Projected Gradient-based Algorithms for   Nonconvex-concave Saddle Point Problems with Bilevel Structure

Mohammad Mahdi Ahmadi; Erfan Yazdandoost Hamedani

arXiv:2404.13021·math.OC·March 31, 2025·J. Sci. Comput.

Single-loop Projection-free and Projected Gradient-based Algorithms for Nonconvex-concave Saddle Point Problems with Bilevel Structure

Mohammad Mahdi Ahmadi, Erfan Yazdandoost Hamedani

PDF

Open Access

TL;DR

This paper introduces novel single-loop projection-free and projected gradient algorithms for nonconvex-concave saddle point problems with bilevel structure, applicable to machine learning tasks like robust multi-task learning, with improved iteration complexities.

Contribution

It proposes the first single-loop, projection-free and projected gradient algorithms for bilevel saddle point problems, extending applicability beyond strongly concave cases.

Findings

01

The projection-free algorithm requires $ ilde{O}(rac{1}{ ext{epsilon}^4})$ iterations for an $ ext{epsilon}$-stationary point.

02

The gradient-based algorithm achieves $ ilde{O}(rac{1}{ ext{epsilon}^5})$ iterations, improved to $ ilde{O}(rac{1}{ ext{epsilon}^4})$ under strong concavity.

03

Experiments demonstrate the superiority of the proposed methods over state-of-the-art algorithms.

Abstract

In this paper, we explore a broad class of constrained saddle point problems with a bilevel structure, wherein the upper-level objective function is nonconvex-concave and smooth over compact and convex constraint sets, subject to a strongly convex lower-level objective function. This class of problems finds wide applicability in machine learning, encompassing robust multi-task learning, adversarial learning, and robust meta-learning. Our study extends the current literature in two main directions: (i) We consider a more general setting where the upper-level function is not necessarily strongly concave or linear in the maximization variable. (ii) While existing methods for solving saddle point problems with a bilevel structure are projection-based algorithms, we propose a one-sided projection-free method employing a linear minimization oracle. Specifically, by utilizing regularization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Stochastic Gradient Optimization Techniques