Adaptive Algorithms for Nonconvex Bilevel Optimization under P{\L} Conditions

Xu Shi; Yinglin Du; Rufeng Xiao; Rujun Jiang

arXiv:2512.24291·math.OC·January 1, 2026

Adaptive Algorithms for Nonconvex Bilevel Optimization under P{\L} Conditions

Xu Shi, Yinglin Du, Rufeng Xiao, Rujun Jiang

PDF

Open Access

TL;DR

This paper introduces fully adaptive algorithms for nonconvex bilevel optimization under Polyak-Łojasiewicz conditions, removing the need for problem-specific parameters and achieving optimal iteration and oracle complexities.

Contribution

The paper presents the first fully adaptive methods for nonconvex bilevel optimization under P{ extL} conditions, eliminating the need for prior parameter knowledge.

Findings

01

Achieve $ ilde{O}(1/ ext{epsilon}^2)$ iteration complexity.

02

Attain near-optimal first-order oracle complexity.

03

Match the complexity of gradient descent for single-level problems.

Abstract

Existing methods for nonconvex bilevel optimization (NBO) require prior knowledge of first- and second-order problem-specific parameters (e.g., Lipschitz constants and the Polyak-{\L}ojasiewicz (P{\L}) parameters) to set step sizes, a requirement that poses practical limitations when such parameters are unknown or computationally expensive. We introduce the Adaptive Fully First-order Bilevel Approximation (AF $^{2}$ BA) algorithm and its accelerated variant, A $^{2}$ F $^{2}$ BA, for solving NBO problems under the P{\L} conditions. To our knowledge, these are the first methods to employ fully adaptive step size strategies, eliminating the need for any problem-specific parameters in NBO. We prove that both algorithms achieve $O (1/ ϵ^{2})$ iteration complexity for finding an $ϵ$ -stationary point, matching the iteration complexity of existing well-tuned methods. Furthermore,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Optimization and Variational Analysis · Sparse and Compressive Sensing Techniques