First-Order Methods for Linearly Constrained Bilevel Optimization
Guy Kornowski, Swati Padmanabhan, Kai Wang, Zhe Zhang, and Suvrit Sra

TL;DR
This paper introduces first-order algorithms for linearly constrained bilevel optimization that achieve near-optimal convergence rates, addressing the computational challenges of Hessian calculations in high-dimensional problems.
Contribution
The authors develop new first-order methods with finite-time guarantees for constrained bilevel problems, including nearly-optimal and dimension-free convergence rates.
Findings
Achieve $ ilde{O}(rac{1}{\e^2})$ gradient calls for linear equality constraints.
Attain $(\delta, ext{ extepsilon})$-Goldstein stationarity in $ ilde{O}(d ext{ extdelta}^{-1} ext{ extepsilon}^{-3})$ calls for linear inequality constraints.
Dimension-free rates of $ ilde{O}( ext{ extdelta}^{-1} ext{ extepsilon}^{-4})$ oracle complexity under additional dual variable access.
Abstract
Algorithms for bilevel optimization often encounter Hessian computations, which are prohibitive in high dimensions. While recent works offer first-order methods for unconstrained bilevel problems, the constrained setting remains relatively underexplored. We present first-order linearly constrained optimization methods with finite-time hypergradient stationarity guarantees. For linear equality constraints, we attain -stationarity in gradient oracle calls, which is nearly-optimal. For linear inequality constraints, we attain -Goldstein stationarity in gradient oracle calls, where is the upper-level dimension. Finally, we obtain for the linear inequality setting dimension-free rates of oracle complexity under the additional assumption of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Stochastic processes and financial applications · Optimization and Variational Analysis
