Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion

Ashok Cutkosky; Harsh Mehta; Francesco Orabona

arXiv:2302.03775·cs.LG·August 8, 2025·6 cites

Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion

Ashok Cutkosky, Harsh Mehta, Francesco Orabona

PDF

Open Access 1 Video

TL;DR

This paper introduces new algorithms for non-smooth, non-convex stochastic optimization that improve complexity bounds by leveraging a reduction to online learning, achieving optimal or near-optimal results.

Contribution

The paper presents a novel reduction technique from non-smooth non-convex optimization to online learning, leading to improved complexity bounds and unifying existing results.

Findings

01

Reduced complexity for finding $(oldsymbol{ ext{δ}},oldsymbol{ ext{ε}})$-stationary points to $O( ext{ε}^{-3} ext{δ}^{-1})$

02

Achieved a complexity of $O( ext{ε}^{-1.5} ext{δ}^{-0.5})$ for smooth objectives using optimistic online learning

03

Unified and recovered optimal results for smooth and second-order smooth objectives in stochastic and deterministic settings.

Abstract

We present new algorithms for optimizing non-smooth, non-convex stochastic objectives based on a novel analysis technique. This improves the current best-known complexity for finding a $(δ, ϵ)$ -stationary point from $O (ϵ^{- 4} δ^{- 1})$ stochastic gradient queries to $O (ϵ^{- 3} δ^{- 1})$ , which we also show to be optimal. Our primary technique is a reduction from non-smooth non-convex optimization to online learning, after which our results follow from standard regret bounds in online learning. For deterministic and second-order smooth objectives, applying more advanced optimistic online learning techniques enables a new complexity of $O (ϵ^{- 1.5} δ^{- 0.5})$ . Our techniques also recover all optimal or best-known results for finding $ϵ$ stationary points of smooth or second-order smooth objectives in both stochastic and deterministic settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research