Stochastic gradient-free descents
Xiaopeng Luo, Xin Xu

TL;DR
This paper introduces stochastic gradient-free and accelerated methods with momentum for stochastic optimization, analyzing their convergence behavior and demonstrating their effectiveness in both convex and nonconvex settings.
Contribution
It proposes novel stochastic gradient-free algorithms with momentum, providing theoretical convergence analysis and showing they achieve optimal rates under various conditions.
Findings
Gradient-free methods maintain sublinear convergence with decaying stepsize.
Accelerated methods with momentum achieve faster convergence rates.
All methods converge to stationary points in nonconvex scenarios.
Abstract
In this paper we propose stochastic gradient-free methods and accelerated methods with momentum for solving stochastic optimization problems. All these methods rely on stochastic directions rather than stochastic gradients. We analyze the convergence behavior of these methods under the mean-variance framework, and also provide a theoretical analysis about the inclusion of momentum in stochastic settings which reveals that the momentum term we used adds a deviation of order but controls the variance at the order for the th iteration. So it is shown that, when employing a decaying stepsize , the stochastic gradient-free methods can still maintain the sublinear convergence rate and the accelerated methods with momentum can achieve a convergence rate in probability for the strongly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Advanced Bandit Algorithms Research
