Online Learning of Halfspaces with Massart Noise

Ilias Diakonikolas; Vasilis Kontonis; Christos Tzamos; Nikos Zarifis

arXiv:2405.12958·cs.LG·May 22, 2024

Online Learning of Halfspaces with Massart Noise

Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

PDF

Open Access

TL;DR

This paper introduces an efficient online learning algorithm for halfspaces under Massart noise, achieving near-optimal mistake bounds and extending to a bandit setting with linear ranking rewards.

Contribution

It presents the first computationally efficient online algorithm for Massart noise in halfspaces with tight mistake bounds and extends the framework to a linear ranking bandit setting.

Findings

01

Achieves mistake bound of ηT + o(T) for Massart noise.

02

Extends to a bandit setting with linear ranking rewards.

03

Provides an efficient algorithm outperforming random actions in reward.

Abstract

We study the task of online learning in the presence of Massart noise. Instead of assuming that the online adversary chooses an arbitrary sequence of labels, we assume that the context $x$ is selected adversarially but the label $y$ presented to the learner disagrees with the ground-truth label of $x$ with unknown probability at most $η$ . We study the fundamental class of $γ$ -margin linear classifiers and present a computationally efficient algorithm that achieves mistake bound $η T + o (T)$ . Our mistake bound is qualitatively tight for efficient algorithms: it is known that even in the offline setting achieving classification error better than $η$ requires super-polynomial time in the SQ model. We extend our online learning model to a $k$ -arm contextual bandit setting where the rewards -- instead of satisfying commonly used realizability assumptions --…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMetaheuristic Optimization Algorithms Research · Machine Learning and Algorithms