Zeroth order optimization with orthogonal random directions
David Kozak, Cesare Molinari, Lorenzo Rosasco, Luis Tenorio, Silvia, Villa

TL;DR
This paper introduces a new zeroth-order optimization method using orthogonal random directions, providing convergence guarantees and rates for convex and certain non-convex objectives, with theoretical and experimental validation.
Contribution
It presents a novel orthogonal random directions approach for zeroth-order optimization, unifying and extending existing methods with proven convergence guarantees.
Findings
Convergence guarantees for convex objectives.
Convergence rates under different assumptions.
Numerical experiments validating theoretical results.
Abstract
We propose and analyze a randomized zeroth-order approach based on approximating the exact gradient byfinite differences computed in a set of orthogonal random directions that changes with each iteration. A number ofpreviously proposed methods are recovered as special cases including spherical smoothing, coordinate descent, as wellas discretized gradient descent. Our main contribution is proving convergence guarantees as well as convergence ratesunder different parameter choices and assumptions. In particular, we consider convex objectives, but also possiblynon-convex objectives satisfying the Polyak-{\L}ojasiewicz (PL) condition. Theoretical results are complemented andillustrated by numerical experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
