Zeroth order optimization with orthogonal random directions

David Kozak; Cesare Molinari; Lorenzo Rosasco; Luis Tenorio; Silvia; Villa

arXiv:2107.03941·math.OC·November 16, 2021

Zeroth order optimization with orthogonal random directions

David Kozak, Cesare Molinari, Lorenzo Rosasco, Luis Tenorio, Silvia, Villa

PDF

Open Access

TL;DR

This paper introduces a new zeroth-order optimization method using orthogonal random directions, providing convergence guarantees and rates for convex and certain non-convex objectives, with theoretical and experimental validation.

Contribution

It presents a novel orthogonal random directions approach for zeroth-order optimization, unifying and extending existing methods with proven convergence guarantees.

Findings

01

Convergence guarantees for convex objectives.

02

Convergence rates under different assumptions.

03

Numerical experiments validating theoretical results.

Abstract

We propose and analyze a randomized zeroth-order approach based on approximating the exact gradient byfinite differences computed in a set of orthogonal random directions that changes with each iteration. A number ofpreviously proposed methods are recovered as special cases including spherical smoothing, coordinate descent, as wellas discretized gradient descent. Our main contribution is proving convergence guarantees as well as convergence ratesunder different parameter choices and assumptions. In particular, we consider convex objectives, but also possiblynon-convex objectives satisfying the Polyak-{\L}ojasiewicz (PL) condition. Theoretical results are complemented andillustrated by numerical experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Machine Learning and ELM