Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization
Feihu Huang

TL;DR
This paper introduces an efficient Hessian/Jacobian-free algorithm for nonconvex-PL bilevel optimization problems, achieving optimal convergence rates and reducing computational complexity compared to existing methods.
Contribution
The paper proposes HJFBiO, a novel Hessian/Jacobian-free method with optimal convergence complexity for nonconvex-PL bilevel problems, improving efficiency and theoretical guarantees.
Findings
Achieves an $O(1/T)$ convergence rate.
Has an $O(rac{1}{ ext{epsilon}})$ gradient complexity.
Demonstrates efficiency in numerical experiments.
Abstract
Bilevel optimization is widely applied in many machine learning tasks such as hyper-parameter learning, meta learning and reinforcement learning. Although many algorithms recently have been developed to solve the bilevel optimization problems, they generally rely on the (strongly) convex lower-level problems. More recently, some methods have been proposed to solve the nonconvex-PL bilevel optimization problems, where their upper-level problems are possibly nonconvex, and their lower-level problems are also possibly nonconvex while satisfying Polyak-{\L}ojasiewicz (PL) condition. However, these methods still have a high convergence complexity or a high computation complexity such as requiring compute expensive Hessian/Jacobian matrices and its inverses. In the paper, thus, we propose an efficient Hessian/Jacobian-free method (i.e., HJFBiO) with the optimal convergence complexity to solve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Variational Analysis · Advanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques
