Agnostic Smoothed Online Learning without Knowledge of the Base Measure
Mo\"ise Blanchard

TL;DR
This paper introduces an algorithm for smoothed online learning that operates effectively without prior knowledge of the base measure, bridging the gap between i.i.d. and adversarial models.
Contribution
It presents the first algorithm guaranteeing sublinear regret in agnostic smoothed online learning without knowing the base measure.
Findings
Achieves adaptive regret of O(T/) for classification.
Ensures sublinear oblivious regret for regression with polynomial fat-shattering dimension.
Extends smoothed online learning to more realistic scenarios without prior measure knowledge.
Abstract
Classical results in statistical learning typically consider two extreme data-generating models: i.i.d. instances from an unknown distribution, or fully adversarial instances, often much more challenging statistically. To bridge the gap between these models, recent work introduced the smoothed framework, in which at each iteration an adversary generates instances from a distribution constrained to have density bounded by compared to some fixed base measure . This framework interpolates between the i.i.d. and adversarial cases, depending on the value of . For the classical online prediction problem, most prior results in smoothed online learning rely on the arguably strong assumption that the base measure is known to the learner, contrasting with standard settings in the PAC learning or consistency literature. We consider the general agnostic problem in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics
MethodsBalanced Selection
