Alternating Regret for Online Convex Optimization

Soumita Hait; Ping Li; Haipeng Luo; Mengxiao Zhang

arXiv:2502.12529·cs.LG·June 19, 2025

Alternating Regret for Online Convex Optimization

Soumita Hait, Ping Li, Haipeng Luo, Mengxiao Zhang

PDF

Open Access

TL;DR

This paper demonstrates that the continuous Hedge algorithm achieves sublinear alternating regret in online convex optimization, enabling equilibrium convergence in two-player games, and introduces improved algorithms with better regret bounds for smooth losses.

Contribution

It extends alternating regret bounds from linear to general convex problems and proposes algorithms with improved regret rates for smooth and self-concordant losses.

Findings

01

Continuous Hedge achieves $ ilde{O}(d^{2/3}T^{1/3})$ regret in OCO.

02

Algorithms find Nash or correlated equilibria at $ ilde{O}(d^{2/3}/T^{2/3})$ rate.

03

New algorithms attain $ ilde{O}(T^{2/5})$ regret without dimension dependence.

Abstract

Motivated by alternating learning dynamics in two-player games, a recent work by Cevher et al.(2024) shows that $o (T)$ alternating regret is possible for any $T$ -round adversarial Online Linear Optimization (OLO) problem, and left as an open question whether the same is true for general Online Convex Optimization (OCO). We answer this question in the affirmative by showing that the continuous Hedge algorithm achieves $\tilde{O} (d^{\frac{2}{3}} T^{\frac{1}{3}})$ alternating regret for any adversarial $d$ -dimensional OCO problems. We show that this implies an alternating learning dynamic that finds a Nash equilibrium for any convex-concave zero-sum games or a coarse correlated equilibrium for any convex two-player general-sum games at a rate of $\tilde{O} (d^{\frac{2}{3}} / T^{\frac{2}{3}})$ . To further improve the time complexity and/or the dimension dependence, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Advanced Wireless Network Optimization · Smart Parking Systems Research