Online Learning of Halfspaces with Massart Noise
Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

TL;DR
This paper introduces an efficient online learning algorithm for halfspaces under Massart noise, achieving near-optimal mistake bounds and extending to a bandit setting with linear ranking rewards.
Contribution
It presents the first computationally efficient online algorithm for Massart noise in halfspaces with tight mistake bounds and extends the framework to a linear ranking bandit setting.
Findings
Achieves mistake bound of ηT + o(T) for Massart noise.
Extends to a bandit setting with linear ranking rewards.
Provides an efficient algorithm outperforming random actions in reward.
Abstract
We study the task of online learning in the presence of Massart noise. Instead of assuming that the online adversary chooses an arbitrary sequence of labels, we assume that the context is selected adversarially but the label presented to the learner disagrees with the ground-truth label of with unknown probability at most . We study the fundamental class of -margin linear classifiers and present a computationally efficient algorithm that achieves mistake bound . Our mistake bound is qualitatively tight for efficient algorithms: it is known that even in the offline setting achieving classification error better than requires super-polynomial time in the SQ model. We extend our online learning model to a -arm contextual bandit setting where the rewards -- instead of satisfying commonly used realizability assumptions --…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetaheuristic Optimization Algorithms Research · Machine Learning and Algorithms
