A Zeroth-order Proximal Stochastic Gradient Method for Weakly Convex   Stochastic Optimization

Spyridon Pougkakiotis; Dionysios S. Kalogerias

arXiv:2205.01633·math.OC·April 21, 2025·SIAM J. Sci. Comput.

A Zeroth-order Proximal Stochastic Gradient Method for Weakly Convex Stochastic Optimization

Spyridon Pougkakiotis, Dionysios S. Kalogerias

PDF

Open Access 1 Repo

TL;DR

This paper introduces a zeroth-order proximal stochastic gradient method for weakly convex stochastic optimization, enabling optimization without gradient information and demonstrating effectiveness in hyper-parameter tuning and PDE-constrained problems.

Contribution

It proposes a novel zeroth-order proximal stochastic gradient algorithm using Gaussian smoothing for weakly convex problems, with proven convergence rates and practical applications.

Findings

01

Achieves state-of-the-art convergence rates for zeroth-order methods.

02

Effectively tunes hyper-parameters automatically in complex optimization tasks.

03

Demonstrates empirical success on phase retrieval and PDE-constrained problems.

Abstract

In this paper we analyze a zeroth-order proximal stochastic gradient method suitable for the minimization of weakly convex stochastic optimization problems. We consider nonsmooth and nonlinear stochastic composite problems, for which (sub-)gradient information might be unavailable. The proposed algorithm utilizes the well-known Gaussian smoothing technique, which yields unbiased zeroth-order gradient estimators of a related partially smooth surrogate problem (in which one of the two nonsmooth terms in the original problem's objective is replaced by a smooth approximation). This allows us to employ a standard proximal stochastic gradient scheme for the approximate solution of the surrogate problem, which is determined by a single smoothing parameter, and without the utilization of first-order information. We provide state-of-the-art convergence rates for the proposed zeroth-order method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

spougkakiotis/z-proxsg
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning