Zeroth-order methods for noisy H\"older-gradient functions
Innokentiy Shibaev, Pavel Dvurechensky, Alexander Gasnikov

TL;DR
This paper extends complexity bounds for zeroth-order optimization methods to non-convex functions with H"older-continuous gradients under noisy observations, providing convergence guarantees and noise level bounds.
Contribution
It generalizes Nesterov and Spokoiny's Gaussian smoothing approach to H"older-gradient functions with noise, deriving new convergence and noise bounds.
Findings
Convergence to stationary points under noisy zeroth-order oracle.
Bounds on noise levels for guaranteed convergence.
Improved dimension dependence for the case of ta=1.
Abstract
In this paper, we prove new complexity bounds for zeroth-order methods in non-convex optimization with inexact observations of the objective function values. We use the Gaussian smoothing approach of Nesterov and Spokoiny [2015] and extend their results, obtained for optimization methods for smooth zeroth-order non-convex problems, to the setting of minimization of functions with H\"older-continuous gradient with noisy zeroth-order oracle, obtaining noise upper-bounds as well. We consider finite-difference gradient approximation based on normally distributed random Gaussian vectors and prove that gradient descent scheme based on this approximation converges to the stationary point of the smoothed function. We also consider convergence to the stationary point of the original (not smoothed) function and obtain bounds on the number of steps of the algorithm for making the norm of its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
