On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization

Shaocong Ma; Heng Huang

arXiv:2510.19953·cs.LG·October 24, 2025

On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization

Shaocong Ma, Heng Huang

PDF

Open Access

TL;DR

This paper introduces a new family of unbiased gradient estimators for zeroth-order optimization that use only function evaluations, improving accuracy and convergence in stochastic settings.

Contribution

It proposes a novel unbiased estimator based on function evaluations, reformulating directional derivatives as a telescoping series and optimizing sampling distributions.

Findings

01

Achieves unbiasedness with favorable variance properties.

02

Proves optimal complexity for smooth non-convex objectives.

03

Demonstrates superior accuracy and convergence in experiments.

Abstract

Zeroth-order optimization (ZOO) is an important framework for stochastic optimization when gradients are unavailable or expensive to compute. A potential limitation of existing ZOO methods is the bias inherent in most gradient estimators unless the perturbation stepsize vanishes. In this paper, we overcome this biasedness issue by proposing a novel family of unbiased gradient estimators based solely on function evaluations. By reformulating directional derivatives as a telescoping series and sampling from carefully designed distributions, we construct estimators that eliminate bias while maintaining favorable variance. We analyze their theoretical properties, derive optimal scaling distributions and perturbation stepsizes of four specific constructions, and prove that SGD using the proposed estimators achieves optimal complexity for smooth non-convex objectives. Experiments on synthetic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Multi-Objective Optimization Algorithms · Model Reduction and Neural Networks