Online Learning in Dynamically Changing Environments

Changlong Wu; Ananth Grama; Wojciech Szpankowski

arXiv:2302.00103·cs.LG·November 14, 2023

Online Learning in Dynamically Changing Environments

Changlong Wu, Ananth Grama, Wojciech Szpankowski

PDF

Open Access

TL;DR

This paper investigates online learning in non-stationary environments, providing tight regret bounds that depend on the number of process changes and the complexity of the hypothesis class, advancing understanding of learning under changing data distributions.

Contribution

It introduces a novel framework for analyzing online learning with non-stationary data, establishing tight regret bounds that depend on process change complexity and hypothesis class VC dimension.

Findings

01

Established tight regret bounds for non-stationary processes with bounded changes.

02

Extended results to general mixable losses with improved bounds.

03

Demonstrated sub-linear regret for smooth adversary processes with threshold functions.

Abstract

We study the problem of online learning and online regret minimization when samples are drawn from a general unknown non-stationary process. We introduce the concept of a dynamic changing process with cost $K$ , where the conditional marginals of the process can vary arbitrarily, but that the number of different conditional marginals is bounded by $K$ over $T$ rounds. For such processes we prove a tight (upto $lo g T$ factor) bound $O (K T \cdot VC (H) lo g T)$ for the expected worst case regret of any finite VC-dimensional class $H$ under absolute loss (i.e., the expected miss-classification loss). We then improve this bound for general mixable losses, by establishing a tight (up to $lo g^{3} T$ factor) regret bound $O (K \cdot VC (H) lo g^{3} T)$ . We extend these results to general smooth adversary processes with unknown reference measure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems