Gradient-free optimization of highly smooth functions: improved analysis   and a new algorithm

Arya Akhavan; Evgenii Chzhen; Massimiliano Pontil; and Alexandre B.; Tsybakov

arXiv:2306.02159·math.ST·June 6, 2023·J. Mach. Learn. Res.·5 cites

Gradient-free optimization of highly smooth functions: improved analysis and a new algorithm

Arya Akhavan, Evgenii Chzhen, Massimiliano Pontil, and Alexandre B., Tsybakov

PDF

Open Access

TL;DR

This paper introduces a new zero-order optimization algorithm for highly smooth functions, providing improved theoretical convergence rates and analysis, especially under noisy conditions, with extensions to non-convex and special classes of functions.

Contribution

It presents a novel algorithm based on $ ext{l}_1$ sphere randomization, with improved analysis and bounds over existing methods, and introduces new proof techniques using Poincaré inequalities.

Findings

01

Improved convergence rates for highly smooth functions under noise.

02

The $ ext{l}_1$ sphere algorithm outperforms $ ext{l}_2$ sphere and Gaussian methods in bias and variance.

03

Minimax lower bounds show near optimality of the proposed bounds.

Abstract

This work studies minimization problems with zero-order noisy oracle information under the assumption that the objective function is highly smooth and possibly satisfies additional properties. We consider two kinds of zero-order projected gradient descent algorithms, which differ in the form of the gradient estimator. The first algorithm uses a gradient estimator based on randomization over the $ℓ_{2}$ sphere due to Bach and Perchet (2016). We present an improved analysis of this algorithm on the class of highly smooth and strongly convex functions studied in the prior work, and we derive rates of convergence for two more general classes of non-convex functions. Namely, we consider highly smooth functions satisfying the Polyak-{\L}ojasiewicz condition and the class of highly smooth functions with no additional property. The second algorithm is based on randomization over the $ℓ_{1}$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Risk and Portfolio Optimization