Optimal Local Convergence Rates of Stochastic First-Order Methods under Local $\alpha$-PL
Saeed Masiha, Saber Salehkaleybar, Niao He, Negar Kiyavash, and Patrick Thiran

TL;DR
This paper establishes the optimal local convergence rates of stochastic first-order methods under a local -PL condition, providing matching lower and upper bounds that depend on the parameter , with implications for both non-convex and convex optimization.
Contribution
The paper derives tight bounds for stochastic first-order methods under a local -PL condition, extending understanding of convergence rates across different regimes.
Findings
Lower bound of (\u03b5)^{-2/} for all stochastic first-order methods.
Matching upper bound achieved by a SARAH-type variance-reduced method.
Complexity bounds in the convex setting under local -PL condition.
Abstract
We study the local convergence rate of stochastic first-order methods under a local -Polyak-Lojasiewicz (-PL) condition in a neighborhood of a target connected component of the local minimizer set. The parameter is the exponent of the gradient norm in the -PL inequality: recovers the classical PL case, corresponds to Holder-type error bounds, and intermediate values interpolate between these regimes. Our performance criterion is the number of oracle queries required to output with , where for any . We work in a local regime where the algorithm is initialized near and, with high probability, its iterates remain in that neighborhood. We establish a lower bound for all stochastic first-order…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Advanced Memory and Neural Computing
