Nonconvex Zeroth-Order Stochastic ADMM Methods with Lower Function Query Complexity
Feihu Huang, Shangqian Gao, Jian Pei, Heng Huang

TL;DR
This paper introduces faster zeroth-order stochastic ADMM algorithms for nonconvex optimization, significantly reducing function query complexity and effectively handling complex penalties and constraints in machine learning tasks.
Contribution
It proposes ZO-SPIDER-ADMM and ZOO-ADMM+ methods with lower query complexities, improving efficiency over existing zeroth-order ADMM approaches for nonconvex problems.
Findings
Achieves lower function query complexity of O(nd+dn^{1/2}ε^{-1})
Improves existing methods by a factor of O(d^{1/3}n^{1/6})
Demonstrates efficiency through experiments on black-box neural network attacks
Abstract
Zeroth-order (a.k.a, derivative-free) methods are a class of effective optimization methods for solving complex machine learning problems, where gradients of the objective functions are not available or computationally prohibitive. Recently, although many zeroth-order methods have been developed, these approaches still have two main drawbacks: 1) high function query complexity; 2) not being well suitable for solving the problems with complex penalties and constraints. To address these challenging drawbacks, in this paper, we propose a class of faster zeroth-order stochastic alternating direction method of multipliers (ADMM) methods (ZO-SPIDER-ADMM) to solve the nonconvex finite-sum problems with multiple nonsmooth penalties. Moreover, we prove that the ZO-SPIDER-ADMM methods can achieve a lower function query complexity of for finding an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Sparse and Compressive Sensing Techniques
MethodsAlternating Direction Method of Multipliers
