Stochastic gradient-free descents

Xiaopeng Luo; Xin Xu

arXiv:1912.13305·math.OC·January 15, 2020·1 cites

Stochastic gradient-free descents

Xiaopeng Luo, Xin Xu

PDF

Open Access

TL;DR

This paper introduces stochastic gradient-free and accelerated methods with momentum for stochastic optimization, analyzing their convergence behavior and demonstrating their effectiveness in both convex and nonconvex settings.

Contribution

It proposes novel stochastic gradient-free algorithms with momentum, providing theoretical convergence analysis and showing they achieve optimal rates under various conditions.

Findings

01

Gradient-free methods maintain sublinear convergence with decaying stepsize.

02

Accelerated methods with momentum achieve faster convergence rates.

03

All methods converge to stationary points in nonconvex scenarios.

Abstract

In this paper we propose stochastic gradient-free methods and accelerated methods with momentum for solving stochastic optimization problems. All these methods rely on stochastic directions rather than stochastic gradients. We analyze the convergence behavior of these methods under the mean-variance framework, and also provide a theoretical analysis about the inclusion of momentum in stochastic settings which reveals that the momentum term we used adds a deviation of order $O (1/ k)$ but controls the variance at the order $O (1/ k)$ for the $k$ th iteration. So it is shown that, when employing a decaying stepsize $α_{k} = O (1/ k)$ , the stochastic gradient-free methods can still maintain the sublinear convergence rate $O (1/ k)$ and the accelerated methods with momentum can achieve a convergence rate $O (1/ k^{2})$ in probability for the strongly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Advanced Bandit Algorithms Research