Accelerated zero-order SGD under high-order smoothness and overparameterized regime
Georgii Bychkov, Darina Dvinskikh, Anastasia Antsiferova, Alexander, Gasnikov, Aleksandr Lobanov

TL;DR
This paper introduces a gradient-free optimization algorithm that leverages high-order smoothness and overparameterization to improve performance in black-box convex stochastic problems with noisy feedback.
Contribution
It develops an accelerated zero-order stochastic gradient descent method that exploits high-order smoothness and overparameterization, providing optimal oracle complexity and convergence guarantees.
Findings
Algorithm achieves optimal oracle complexity.
Convergence guarantees under adversarial noise.
Effective in high-dimensional, overparameterized models.
Abstract
We present a novel gradient-free algorithm to solve a convex stochastic optimization problem, such as those encountered in medicine, physics, and machine learning (e.g., adversarial multi-armed bandit problem), where the objective function can only be computed through numerical simulation, either as the result of a real experiment or as feedback given by the function evaluations from an adversary. Thus we suppose that only a black-box access to the function values of the objective is available, possibly corrupted by adversarial noise: deterministic or stochastic. The noisy setup can arise naturally from modeling randomness within a simulation or by computer discretization, or when exact values of function are forbidden due to privacy issues, or when solving non-convex problems as convex ones with an inexact function oracle. By exploiting higher-order smoothness, fulfilled, e.g., in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
MethodsLogistic Regression
