The Hidden Cost of Approximation in Online Mirror Descent

Ofir Schlisselberg; Uri Sherman; Tomer Koren; Yishay Mansour

arXiv:2511.22283·cs.LG·December 1, 2025

The Hidden Cost of Approximation in Online Mirror Descent

Ofir Schlisselberg, Uri Sherman, Tomer Koren, Yishay Mansour

PDF

Open Access

TL;DR

This paper investigates the impact of approximation errors in online mirror descent, revealing how regularizer smoothness influences robustness and providing bounds on regret under various regularizers and stochastic settings.

Contribution

It systematically analyzes inexact OMD, establishing bounds on regret related to regularizer smoothness and identifying conditions under which different regularizers are robust to errors.

Findings

01

Uniformly smooth regularizers have tight bounds on excess regret due to errors.

02

Negative entropy requires exponentially small errors to prevent linear regret.

03

Log-barrier and Tsallis regularizers are robust even with polynomial errors.

Abstract

Online mirror descent (OMD) is a fundamental algorithmic paradigm that underlies many algorithms in optimization, machine learning and sequential decision-making. The OMD iterates are defined as solutions to optimization subproblems which, oftentimes, can be solved only approximately, leading to an inexact version of the algorithm. Nonetheless, existing OMD analyses typically assume an idealized error free setting, thereby limiting our understanding of performance guarantees that should be expected in practice. In this work we initiate a systematic study into inexact OMD, and uncover an intricate relation between regularizer smoothness and robustness to approximation errors. When the regularizer is uniformly smooth, we establish a tight bound on the excess regret due to errors. Then, for barrier regularizers over the simplex and its subsets, we identify a sharp separation: negative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Mobile Crowdsensing and Crowdsourcing