Small errors in random zeroth-order optimization are imaginary
Wouter Jongeneel, Man-Chung Yue, Daniel Kuhn

TL;DR
This paper introduces a novel complex-domain gradient estimator for zeroth-order optimization that requires only one function evaluation, maintains bounded variance as the smoothing parameter shrinks, and improves practical convergence.
Contribution
The authors develop a complex-domain gradient estimator that is immune to numerical cancellation and maintains bounded variance, enhancing zeroth-order optimization algorithms.
Findings
Estimator requires only one function evaluation.
Variance remains bounded as the smoothing parameter approaches zero.
Algorithms using this estimator converge faster in practice.
Abstract
Most zeroth-order optimization algorithms mimic a first-order algorithm but replace the gradient of the objective function with some gradient estimator that can be computed from a small number of function evaluations. This estimator is constructed randomly, and its expectation matches the gradient of a smooth approximation of the objective function whose quality improves as the underlying smoothing parameter is reduced. Gradient estimators requiring a smaller number of function evaluations are preferable from a computational point of view. While estimators based on a single function evaluation can be obtained by use of the divergence theorem from vector calculus, their variance explodes as tends to . Estimators based on multiple function evaluations, on the other hand, suffer from numerical cancellation when tends to . To combat both effects…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Advanced Optimization Algorithms Research · Fractional Differential Equations Solutions
