SPABA: A Single-Loop and Probabilistic Stochastic Bilevel Algorithm Achieving Optimal Sample Complexity
Tianshu Chu, Dachuan Xu, Wei Yao, Jin Zhang

TL;DR
This paper introduces SPABA, a novel stochastic bilevel optimization algorithm that achieves the same optimal sample complexity as single-level methods, resolving a key open question in large-scale nested optimization.
Contribution
The paper adapts the PAGE method to bilevel optimization, proving it attains optimal complexity bounds and demonstrating its superiority over other stochastic gradient estimators.
Findings
SPABA achieves optimal sample complexity in bilevel optimization.
Other stochastic methods like SGD and SAGA may not reach this optimal complexity.
Numerical experiments confirm the practical efficiency of the proposed algorithms.
Abstract
While stochastic bilevel optimization methods have been extensively studied for addressing large-scale nested optimization problems in machine learning, it remains an open question whether the optimal complexity bounds for solving bilevel optimization are the same as those in single-level optimization. Our main result resolves this question: SPABA, an adaptation of the PAGE method for nonconvex optimization in (Li et al., 2021) to the bilevel setting, can achieve optimal sample complexity in both the finite-sum and expectation settings. We show the optimality of SPABA by proving that there is no gap in complexity analysis between stochastic bilevel and single-level optimization when implementing PAGE. Notably, as indicated by the results of (Dagr\'eou et al., 2022), there might exist a gap in complexity analysis when implementing other stochastic gradient estimators, like SGD and SAGA.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Stochastic Gradient Optimization Techniques · Statistical Methods and Inference
MethodsSAGA · Stochastic Gradient Descent
