Zeroth-Order Randomized Subspace Newton Methods
Erik Berglund, Sarit Khirirat, Xiaoyu Wang

TL;DR
This paper introduces ZO-RSN, a zeroth-order randomized subspace Newton method that accelerates convergence for black-box optimization by estimating gradients and Hessians via random sketching, outperforming existing methods.
Contribution
The paper proposes a novel zeroth-order method that uses random sketching to approximate Newton steps in lower-dimensional subspaces, improving efficiency and convergence.
Findings
ZO-RSN achieves lower iteration complexity for strongly convex problems.
Numerical experiments demonstrate ZO-RSN's superior performance in black-box attacks.
ZO-RSN requires fewer function queries than state-of-the-art Hessian-aware zeroth-order methods.
Abstract
Zeroth-order methods have become important tools for solving problems where we have access only to function evaluations. However, the zeroth-order methods only using gradient approximations are times slower than classical first-order methods for solving n-dimensional problems. To accelerate the convergence rate, this paper proposes the zeroth order randomized subspace Newton (ZO-RSN) method, which estimates projections of the gradient and Hessian by random sketching and finite differences. This allows us to compute the Newton step in a lower dimensional subspace, with small computational costs. We prove that ZO-RSN can attain lower iteration complexity than existing zeroth order methods for strongly convex problems. Our numerical experiments show that ZO-RSN can perform black-box attacks under a more restrictive limit on the number of function queries than the state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
