Accelerating Inexact HyperGradient Descent for Bilevel Optimization
Haikuo Yang, Luo Luo, Chris Junchi Li, Michael I. Jordan

TL;DR
This paper introduces RAHGD, a novel accelerated method for nonconvex-strongly-convex bilevel optimization, achieving state-of-the-art theoretical guarantees for finding stationary points with improved oracle complexity.
Contribution
The paper proposes RAHGD, an accelerated hypergradient descent algorithm with optimal complexity bounds for bilevel problems, and extends it to find second-order stationary points.
Findings
Achieves the best-known complexity for stationary point convergence in bilevel optimization.
Improves upper complexity bounds for nonconvex-strongly-concave minimax problems.
Empirical results validate the theoretical improvements.
Abstract
We present a method for solving general nonconvex-strongly-convex bilevel optimization problems. Our method -- the \emph{Restarted Accelerated HyperGradient Descent} (\texttt{RAHGD}) method -- finds an -first-order stationary point of the objective with oracle complexity, where is the condition number of the lower-level objective and is the desired accuracy. We also propose a perturbed variant of \texttt{RAHGD} for finding an -second-order stationary point within the same order of oracle complexity. Our results achieve the best-known theoretical guarantees for finding stationary points in bilevel optimization and also improve upon the existing upper complexity bound for finding second-order stationary points in nonconvex-strongly-concave…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Drug Transport and Resistance Mechanisms · Pediatric Hepatobiliary Diseases and Treatments
