Exploiting Curvature in Online Convex Optimization with Delayed Feedback
Hao Qiu, Emmanuel Esposito, Mengxiao Zhang

TL;DR
This paper introduces new algorithms for online convex optimization with delayed feedback, achieving improved regret bounds for strongly convex, exp-concave, and linear regression losses, and demonstrates their effectiveness through experiments.
Contribution
It proposes novel algorithms that attain better regret bounds in delayed feedback scenarios for various convex loss functions, including strongly convex, exp-concave, and linear regression cases.
Findings
Achieved regret bounds of order _{\u00a0}max _{ ext}T for strongly convex losses.
Extended Online Newton Step to handle delays with adaptive tuning, achieving regret _{ ext}max n _{ ext}T for exp-concave losses.
Designed a variant of the Vovk-Azoury-Warmuth forecaster with clipping for linear regression, with similar guarantees.
Abstract
In this work, we study the online convex optimization problem with curved losses and delayed feedback. When losses are strongly convex, existing approaches obtain regret bounds of order , where is the maximum delay and is the time horizon. However, in many cases, this guarantee can be much worse than as obtained by a delayed version of online gradient descent, where is the total delay. We bridge this gap by proposing a variant of follow-the-regularized-leader that obtains regret of order , where is the maximum number of missing observations. We then consider exp-concave losses and extend the Online Newton Step algorithm to handle delays with an adaptive learning rate tuning, achieving regret …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
