Adaptive Algorithms for Nonconvex Bilevel Optimization under P{\L} Conditions
Xu Shi, Yinglin Du, Rufeng Xiao, Rujun Jiang

TL;DR
This paper introduces fully adaptive algorithms for nonconvex bilevel optimization under Polyak-Łojasiewicz conditions, removing the need for problem-specific parameters and achieving optimal iteration and oracle complexities.
Contribution
The paper presents the first fully adaptive methods for nonconvex bilevel optimization under P{ extL} conditions, eliminating the need for prior parameter knowledge.
Findings
Achieve $ ilde{O}(1/ ext{epsilon}^2)$ iteration complexity.
Attain near-optimal first-order oracle complexity.
Match the complexity of gradient descent for single-level problems.
Abstract
Existing methods for nonconvex bilevel optimization (NBO) require prior knowledge of first- and second-order problem-specific parameters (e.g., Lipschitz constants and the Polyak-{\L}ojasiewicz (P{\L}) parameters) to set step sizes, a requirement that poses practical limitations when such parameters are unknown or computationally expensive. We introduce the Adaptive Fully First-order Bilevel Approximation (AFBA) algorithm and its accelerated variant, AFBA, for solving NBO problems under the P{\L} conditions. To our knowledge, these are the first methods to employ fully adaptive step size strategies, eliminating the need for any problem-specific parameters in NBO. We prove that both algorithms achieve iteration complexity for finding an -stationary point, matching the iteration complexity of existing well-tuned methods. Furthermore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Optimization and Variational Analysis · Sparse and Compressive Sensing Techniques
