Online Learning in Dynamically Changing Environments
Changlong Wu, Ananth Grama, Wojciech Szpankowski

TL;DR
This paper investigates online learning in non-stationary environments, providing tight regret bounds that depend on the number of process changes and the complexity of the hypothesis class, advancing understanding of learning under changing data distributions.
Contribution
It introduces a novel framework for analyzing online learning with non-stationary data, establishing tight regret bounds that depend on process change complexity and hypothesis class VC dimension.
Findings
Established tight regret bounds for non-stationary processes with bounded changes.
Extended results to general mixable losses with improved bounds.
Demonstrated sub-linear regret for smooth adversary processes with threshold functions.
Abstract
We study the problem of online learning and online regret minimization when samples are drawn from a general unknown non-stationary process. We introduce the concept of a dynamic changing process with cost , where the conditional marginals of the process can vary arbitrarily, but that the number of different conditional marginals is bounded by over rounds. For such processes we prove a tight (upto factor) bound for the expected worst case regret of any finite VC-dimensional class under absolute loss (i.e., the expected miss-classification loss). We then improve this bound for general mixable losses, by establishing a tight (up to factor) regret bound . We extend these results to general smooth adversary processes with unknown reference measure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems
