On Momentum-Based Gradient Methods for Bilevel Optimization with Nonconvex Lower-Level
Feihu Huang

TL;DR
This paper introduces momentum-based gradient methods for nonconvex bilevel optimization problems with lower-level PL condition, achieving improved convergence rates and demonstrating effectiveness through experiments.
Contribution
It proposes novel momentum-based algorithms for nonconvex bilevel problems with theoretical convergence guarantees and superior complexity bounds.
Findings
MGBiO achieves $O(rac{1}{\epsilon^2})$ complexity for deterministic problems.
MSGBiO and VR-MSGBiO improve stochastic problem complexities to $ ilde{O}(rac{1}{\epsilon^4})$ and $ ilde{O}(rac{1}{\epsilon^3})$.
Experimental results validate the efficiency of the proposed methods.
Abstract
Bilevel optimization is a popular two-level hierarchical optimization, which has been widely applied to many machine learning tasks such as hyperparameter learning, meta learning and continual learning. Although many bilevel optimization methods recently have been developed, the bilevel methods are not well studied when the lower-level problem is nonconvex. To fill this gap, in the paper, we study a class of nonconvex bilevel optimization problems, where both upper-level and lower-level problems are nonconvex, and the lower-level problem satisfies Polyak-{\L}ojasiewicz (PL) condition. We propose an efficient momentum-based gradient bilevel method (MGBiO) to solve these deterministic problems. Meanwhile, we propose a class of efficient momentum-based stochastic gradient bilevel methods (MSGBiO and VR-MSGBiO) to solve these stochastic problems. Moreover, we provide a useful convergence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Extracellular vesicles in disease
