First-order penalty methods for bilevel optimization
Zhaosong Lu, Sanyou Mei

TL;DR
This paper introduces first-order penalty methods for bilevel optimization problems with nonsmooth lower levels and nonconvex upper levels, providing complexity guarantees and demonstrating the approach as a novel minimax reformulation.
Contribution
It is the first to show bilevel problems can be approximately solved via minimax reformulation and offers the first implementable method with complexity bounds.
Findings
Proposed penalty methods achieve $O( ext{operation complexity})$ for $oldsymbol{ ext{ε-KKT}}$ solutions.
Numerical results demonstrate the effectiveness of the methods.
Bilevel optimization can be approximately solved as a structured minimax problem.
Abstract
In this paper we study a class of unconstrained and constrained bilevel optimization problems in which the lower level is a possibly nonsmooth convex optimization problem, while the upper level is a possibly nonconvex optimization problem. We introduce a notion of -KKT solution for them and show that an -KKT solution leads to an - or -hypergradient based stionary point under suitable assumptions. We also propose first-order penalty methods for finding an -KKT solution of them, whose subproblems turn out to be a structured minimax problem and can be suitably solved by a first-order method recently developed by the authors. Under suitable assumptions, an \emph{operation complexity} of and , measured by their fundamental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Variational Analysis · Advanced Optimization Algorithms Research · Bone and Joint Diseases
