Power Homotopy for Zeroth-Order Non-Convex Optimizations
Chen Xu

TL;DR
This paper introduces GS-PowerHP, a new zeroth-order optimization method for non-convex problems that uses a power-transformed Gaussian surrogate and decaying smoothing parameter, achieving strong theoretical guarantees and empirical performance.
Contribution
The paper presents GS-PowerHP, a novel zeroth-order method combining power-transformed Gaussian smoothing with decaying variance, providing convergence guarantees and state-of-the-art empirical results.
Findings
Converges to a neighborhood of the global maximizer with $O(d^2 \\varepsilon^{-2})$ complexity.
Ranks among top three algorithms on benchmark tests.
Achieves first place in high-dimensional black-box attack experiments.
Abstract
We introduce GS-PowerHP, a novel zeroth-order method for non-convex optimization problems of the form . Our approach leverages two key components: a power-transformed Gaussian-smoothed surrogate whose stationary points cluster near the global maximizer of for sufficiently large , and an incrementally decaying for enhanced data efficiency. Under mild assumptions, we prove convergence in expectation to a small neighborhood of with the iteration complexity of . Empirical results show our approach consistently ranks among the top three across a suite of competing algorithms. Its robustness is underscored by the final experiment on a substantially high-dimensional problem (), where it achieved first place on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Sparse and Compressive Sensing Techniques
