Improved Regret Guarantees for Online Mirror Descent using a Portfolio of Mirror Maps
Swati Gupta, Jai Moondra, Mohit Singh

TL;DR
This paper demonstrates that selecting appropriate mirror maps, especially block norm-based ones, can significantly improve regret bounds in online convex optimization, particularly for sparse loss functions, through adaptive algorithms.
Contribution
It introduces a family of mirror maps based on block norms that adapt better to sparsity, and proposes a meta-algorithm to dynamically select geometries, achieving polynomial regret improvements.
Findings
Block norm-based mirror maps outperform traditional $L_1$ and $L_2$ geometries.
Naive switching between geometries can cause linear regret.
Meta-algorithm effectively adapts to unknown sparsity levels, improving regret guarantees.
Abstract
OMD and its variants give a flexible framework for OCO where the performance depends crucially on the choice of the mirror map. While the geometries underlying OPGD and OEG, both special cases of OMD, are well understood, it remains a challenging open question on how to construct an optimal mirror map for any given constrained set and a general family of loss functions, e.g., sparse losses. Motivated by parameterizing a near-optimal set of mirror maps, we consider a simpler question: is it even possible to obtain polynomial gains in regret by using mirror maps for geometries that interpolate between and , which may not be possible by restricting to only OEG () or OPGD (). Our main result answers this question positively. We show that mirror maps based on block norms adapt better to the sparsity of loss functions, compared to previous (for )…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Risk and Portfolio Optimization
