A Parameter-Free Stochastic LineseArch Method (SLAM) for Minimizing Expectation Residuals
Qi Wang, Uday V. Shanbhag, Yue Xie

TL;DR
This paper introduces a parameter-free stochastic line search method that adaptively determines step sizes without knowing problem parameters, achieving optimal convergence rates for smooth and nonsmooth convex optimization.
Contribution
It proposes a novel Armijo-enabled stochastic linesearch framework that removes the need for prior knowledge of Lipschitz constants, with proven convergence guarantees.
Findings
Expected stationarity residual decreases at rate O(1/√K)
Iteration complexity is O(ε^{-2}) for ε-stationary points
Sample complexity is O(ε^{-4}) for ε-stationary points
Abstract
Most existing rate and complexity guarantees for stochastic gradient methods in -smooth settings mandates that such sequences be non-adaptive, non-increasing, and upper bounded by for . This requires knowledge of and may preclude larger steps. Motivated by these shortcomings, we present an Armijo-enabled stochastic linesearch framework with standard stochastic zeroth- and first-order oracles. The resulting steplength sequence is non-monotone and requires neither knowledge of nor any other problem parameters. We then prove that the expected stationarity residual diminishes at a rate of , where denotes the iteration budget. Furthermore, the resulting iteration and sample complexities for computing an -stationary point are and . The proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Stochastic processes and financial applications
