On the Convergence and Complexity of the Stochastic Central   Finite-Difference Based Gradient Estimation Methods

Raghu Bollapragada; Cem Karamanli

arXiv:2501.06610·math.OC·January 14, 2025

On the Convergence and Complexity of the Stochastic Central Finite-Difference Based Gradient Estimation Methods

Raghu Bollapragada, Cem Karamanli

PDF

TL;DR

This paper introduces a framework for stochastic optimization using finite-difference gradient estimates, providing theoretical convergence guarantees and demonstrating effectiveness through numerical experiments.

Contribution

It develops a new framework that adaptively controls finite-difference gradient accuracy and proves optimal complexity bounds for nonconvex stochastic optimization.

Findings

01

Achieves sublinear convergence to a neighborhood of the solution.

02

Establishes optimal iteration and sample complexity bounds.

03

Demonstrates effectiveness through numerical experiments on nonlinear least squares.

Abstract

This paper presents an algorithmic framework for solving unconstrained stochastic optimization problems using only stochastic function evaluations. We employ central finite-difference based gradient estimation methods to approximate the gradients and dynamically control the accuracy of these approximations by adjusting the sample sizes used in stochastic realizations. We analyze the theoretical properties of the proposed framework on nonconvex functions. Our analysis yields sublinear convergence results to the neighborhood of the solution, and establishes the optimal worst-case iteration complexity ( $O (ϵ^{- 1})$ ) and sample complexity ( $O (ϵ^{- 2})$ ) for each gradient estimation method to achieve an $ϵ$ -accurate solution. Finally, we demonstrate the performance of the proposed framework and the quality of the gradient estimation methods through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.