Single-loop Projection-free and Projected Gradient-based Algorithms for Nonconvex-concave Saddle Point Problems with Bilevel Structure
Mohammad Mahdi Ahmadi, Erfan Yazdandoost Hamedani

TL;DR
This paper introduces novel single-loop projection-free and projected gradient algorithms for nonconvex-concave saddle point problems with bilevel structure, applicable to machine learning tasks like robust multi-task learning, with improved iteration complexities.
Contribution
It proposes the first single-loop, projection-free and projected gradient algorithms for bilevel saddle point problems, extending applicability beyond strongly concave cases.
Findings
The projection-free algorithm requires $ ilde{O}(rac{1}{ ext{epsilon}^4})$ iterations for an $ ext{epsilon}$-stationary point.
The gradient-based algorithm achieves $ ilde{O}(rac{1}{ ext{epsilon}^5})$ iterations, improved to $ ilde{O}(rac{1}{ ext{epsilon}^4})$ under strong concavity.
Experiments demonstrate the superiority of the proposed methods over state-of-the-art algorithms.
Abstract
In this paper, we explore a broad class of constrained saddle point problems with a bilevel structure, wherein the upper-level objective function is nonconvex-concave and smooth over compact and convex constraint sets, subject to a strongly convex lower-level objective function. This class of problems finds wide applicability in machine learning, encompassing robust multi-task learning, adversarial learning, and robust meta-learning. Our study extends the current literature in two main directions: (i) We consider a more general setting where the upper-level function is not necessarily strongly concave or linear in the maximization variable. (ii) While existing methods for solving saddle point problems with a bilevel structure are projection-based algorithms, we propose a one-sided projection-free method employing a linear minimization oracle. Specifically, by utilizing regularization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Stochastic Gradient Optimization Techniques
