An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with   Two-Point Feedback

Ohad Shamir

arXiv:1507.08752·cs.LG·August 3, 2015·23 cites

An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback

Ohad Shamir

PDF

Open Access

TL;DR

This paper introduces a simple, optimal algorithm for bandit and zero-order convex optimization with two-point feedback, applicable to Lipschitz functions and extendable to non-Euclidean settings.

Contribution

It presents a new, simpler algorithm with optimal performance for convex Lipschitz functions, improving upon previous methods limited to smooth functions.

Findings

01

Algorithm is optimal for convex Lipschitz functions.

02

Simpler analysis and extension to non-Euclidean problems.

03

Improves upon previous methods limited to smooth functions.

Abstract

We consider the closely related problems of bandit convex optimization with two-point feedback, and zero-order stochastic convex optimization with two function evaluations per round. We provide a simple algorithm and analysis which is optimal for convex Lipschitz functions. This improves on \cite{dujww13}, which only provides an optimal result for smooth functions; Moreover, the algorithm and analysis are simpler, and readily extend to non-Euclidean problems. The algorithm is based on a small but surprisingly powerful modification of the gradient estimator.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques