A Structured Proximal Stochastic Variance Reduced Zeroth-order Algorithm
Marco Rando, Cheik Traor\'e, Cesare Molinari, Lorenzo Rosasco, Silvia Villa

TL;DR
This paper introduces a structured variance-reduced finite-difference algorithm for non-smooth finite-sum minimization, achieving improved convergence rates without gradient information and demonstrating strong practical performance.
Contribution
It proposes a novel structured variance-reduced finite-difference method for non-smooth optimization, analyzing its convergence and practical efficiency.
Findings
Achieves state-of-the-art convergence rates.
Lower per-iteration computational costs.
Strong empirical performance in experiments.
Abstract
Minimizing finite sums of functions is a central problem in optimization, arising in numerous practical applications. Such problems are commonly addressed using first-order optimization methods. However, these procedures cannot be used in settings where gradient information is unavailable. Finite-difference methods provide an alternative by approximating gradients through function evaluations along a set of directions. For finite-sum minimization problems, it was shown that incorporating variance-reduction techniques into finite-difference methods can improve convergence rates. Additionally, recent studies showed that imposing structure on the directions (e.g., orthogonality) enhances performance. However, the impact of structured directions on variance-reduced finite-difference methods remains unexplored. In this work, we close this gap by proposing a structured variance-reduced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Gaussian Processes and Bayesian Inference
