On the Condition Number Dependency in Bilevel Optimization
Lesi Chen, Jingzhao Zhang

TL;DR
This paper investigates the complexity bounds of bilevel optimization with nonconvex upper-level and strongly convex lower-level problems, establishing new lower and upper bounds that reveal a gap from minimax problems and extend to various settings.
Contribution
It introduces the first provable gap between bilevel and minimax problems in condition number dependency and extends bounds to high-order, stochastic, and convex hyper-objective scenarios.
Findings
Established a $ ilde{ ext{O}}( ext{condition number}^{7/2} imes ext{epsilon}^{-2})$ upper bound.
Proved a $ ext{Omega}( ext{condition number}^2 imes ext{epsilon}^{-2})$ lower bound.
Extended bounds to high-order smooth, stochastic, and convex hyper-objective problems.
Abstract
Bilevel optimization minimizes an objective function, defined by an upper-level problem whose feasible region is the solution of a lower-level problem. We study the oracle complexity of finding an -stationary point with first-order methods when the upper-level problem is nonconvex and the lower-level problem is strongly convex. Recent works (Ji et al., ICML 2021; Arbel and Mairal, ICLR 2022; Chen el al., JMLR 2025) achieve a upper bound that is near-optimal in . However, the optimal dependency on the condition number is unknown. In this work, we establish a new lower bound and upper bound for this problem, establishing the first provable gap between bilevel problems and minimax problems in this setup. Our lower bounds can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Risk and Portfolio Optimization · Optimization and Variational Analysis
