Optimal Algorithms for Stochastic Bilevel Optimization under Relaxed Smoothness Conditions
Xuxing Chen, Tesi Xiao, Krishnakumar Balasubramanian

TL;DR
This paper introduces a new single-loop, Hessian-inversion-free algorithm for stochastic bilevel optimization that operates under standard smoothness assumptions, improving theoretical guarantees and practical performance.
Contribution
It presents a novel algorithmic framework that relaxes smoothness assumptions and extends to multi-objective bilevel problems, with tighter analysis and state-of-the-art complexity results.
Findings
Achieves optimal sample complexity matching single-level bounds.
Handles more general multi-objective bilevel problems.
Demonstrates superior performance in numerical experiments.
Abstract
Stochastic Bilevel optimization usually involves minimizing an upper-level (UL) function that is dependent on the arg-min of a strongly-convex lower-level (LL) function. Several algorithms utilize Neumann series to approximate certain matrix inverses involved in estimating the implicit gradient of the UL function (hypergradient). The state-of-the-art StOchastic Bilevel Algorithm (SOBA) [16] instead uses stochastic gradient descent steps to solve the linear system associated with the explicit matrix inversion. This modification enables SOBA to match the lower bound of sample complexity for the single-level counterpart in non-convex settings. Unfortunately, the current analysis of SOBA relies on the assumption of higher-order smoothness for the UL and LL functions to achieve optimality. In this paper, we introduce a novel fully single-loop and Hessian-inversion-free algorithmic framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Risk and Portfolio Optimization
