Exploiting Curvature in Online Convex Optimization with Delayed Feedback

Hao Qiu; Emmanuel Esposito; Mengxiao Zhang

arXiv:2506.07595·cs.LG·June 10, 2025

Exploiting Curvature in Online Convex Optimization with Delayed Feedback

Hao Qiu, Emmanuel Esposito, Mengxiao Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces new algorithms for online convex optimization with delayed feedback, achieving improved regret bounds for strongly convex, exp-concave, and linear regression losses, and demonstrates their effectiveness through experiments.

Contribution

It proposes novel algorithms that attain better regret bounds in delayed feedback scenarios for various convex loss functions, including strongly convex, exp-concave, and linear regression cases.

Findings

01

Achieved regret bounds of order _{\u00a0}max _{ ext}T for strongly convex losses.

02

Extended Online Newton Step to handle delays with adaptive tuning, achieving regret _{ ext}max n _{ ext}T for exp-concave losses.

03

Designed a variant of the Vovk-Azoury-Warmuth forecaster with clipping for linear regression, with similar guarantees.

Abstract

In this work, we study the online convex optimization problem with curved losses and delayed feedback. When losses are strongly convex, existing approaches obtain regret bounds of order $d_{m a x} ln T$ , where $d_{m a x}$ is the maximum delay and $T$ is the time horizon. However, in many cases, this guarantee can be much worse than $d_{tot}$ as obtained by a delayed version of online gradient descent, where $d_{tot}$ is the total delay. We bridge this gap by proposing a variant of follow-the-regularized-leader that obtains regret of order $min {σ_{m a x} ln T, d_{tot}}$ , where $σ_{m a x}$ is the maximum number of missing observations. We then consider exp-concave losses and extend the Online Newton Step algorithm to handle delays with an adaptive learning rate tuning, achieving regret $min {d_{m a x} n ln T, d_{tot}}$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Exploiting Curvature in Online Convex Optimization with Delayed Feedback· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques