Non-stationary Online Learning for Curved Losses: Improved Dynamic Regret via Mixability

Yu-Jie Zhang; Peng Zhao; Masashi Sugiyama

arXiv:2506.10616·cs.LG·June 13, 2025

Non-stationary Online Learning for Curved Losses: Improved Dynamic Regret via Mixability

Yu-Jie Zhang, Peng Zhao, Masashi Sugiyama

PDF

Open Access

TL;DR

This paper advances non-stationary online learning by leveraging mixability to achieve significantly improved dynamic regret bounds for curved losses like squared and logistic, surpassing prior convex-focused results.

Contribution

It introduces a novel analysis framework that exploits mixability to improve dynamic regret bounds for curved losses in non-stationary environments.

Findings

01

Achieves $ ilde{O}(d T^{1/3} P_T^{2/3})$ dynamic regret for mixable losses.

02

Improves upon previous regret bounds that depended more heavily on dimensionality.

03

Provides a simple analytical approach avoiding complex KKT-based analysis.

Abstract

Non-stationary online learning has drawn much attention in recent years. Despite considerable progress, dynamic regret minimization has primarily focused on convex functions, leaving the functions with stronger curvature (e.g., squared or logistic loss) underexplored. In this work, we address this gap by showing that the regret can be substantially improved by leveraging the concept of mixability, a property that generalizes exp-concavity to effectively capture loss curvature. Let $d$ denote the dimensionality and $P_{T}$ the path length of comparators that reflects the environmental non-stationarity. We demonstrate that an exponential-weight method with fixed-share updates achieves an $O (d T^{1/3} P_{T}^{2/3} lo g T)$ dynamic regret for mixable losses, improving upon the best-known $O (d^{10/3} T^{1/3} P_{T}^{2/3} lo g T)$ result (Baby and Wang, 2021) in $d$ . More…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques

MethodsSoftmax · Attention Is All You Need