On the Condition Number Dependency in Bilevel Optimization

Lesi Chen; Jingzhao Zhang

arXiv:2511.22331·math.OC·December 1, 2025

On the Condition Number Dependency in Bilevel Optimization

Lesi Chen, Jingzhao Zhang

PDF

Open Access

TL;DR

This paper investigates the complexity bounds of bilevel optimization with nonconvex upper-level and strongly convex lower-level problems, establishing new lower and upper bounds that reveal a gap from minimax problems and extend to various settings.

Contribution

It introduces the first provable gap between bilevel and minimax problems in condition number dependency and extends bounds to high-order, stochastic, and convex hyper-objective scenarios.

Findings

01

Established a $ ilde{ ext{O}}( ext{condition number}^{7/2} imes ext{epsilon}^{-2})$ upper bound.

02

Proved a $ ext{Omega}( ext{condition number}^2 imes ext{epsilon}^{-2})$ lower bound.

03

Extended bounds to high-order smooth, stochastic, and convex hyper-objective problems.

Abstract

Bilevel optimization minimizes an objective function, defined by an upper-level problem whose feasible region is the solution of a lower-level problem. We study the oracle complexity of finding an $ϵ$ -stationary point with first-order methods when the upper-level problem is nonconvex and the lower-level problem is strongly convex. Recent works (Ji et al., ICML 2021; Arbel and Mairal, ICLR 2022; Chen el al., JMLR 2025) achieve a $\tilde{O} (κ^{4} ϵ^{- 2})$ upper bound that is near-optimal in $ϵ$ . However, the optimal dependency on the condition number $κ$ is unknown. In this work, we establish a new $Ω (κ^{2} ϵ^{- 2})$ lower bound and $\tilde{O} (κ^{7/2} ϵ^{- 2})$ upper bound for this problem, establishing the first provable gap between bilevel problems and minimax problems in this setup. Our lower bounds can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Risk and Portfolio Optimization · Optimization and Variational Analysis