Learning General Halfspaces with General Massart Noise under the Gaussian Distribution
Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos,, Nikos Zarifis

TL;DR
This paper develops new algorithms for learning halfspaces under Gaussian distribution with Massart noise, removing previous restrictions on the target halfspace's homogeneity and noise parameter, and establishes matching lower bounds for the problem.
Contribution
It introduces the first algorithms for general halfspaces with Massart noise without prior assumptions and proves matching lower bounds for their complexity.
Findings
Algorithms for general halfspaces with Massart noise for ta<1/2 and ta=1/2.
Sample and computational complexity bounds depending on ta and psilon.
Matching SQ lower bounds indicating inherent difficulty of the problem.
Abstract
We study the problem of PAC learning halfspaces on with Massart noise under the Gaussian distribution. In the Massart model, an adversary is allowed to flip the label of each point with unknown probability , for some parameter . The goal is to find a hypothesis with misclassification error of , where is the error of the target halfspace. This problem had been previously studied under two assumptions: (i) the target halfspace is homogeneous (i.e., the separating hyperplane goes through the origin), and (ii) the parameter is strictly smaller than . Prior to this work, no nontrivial bounds were known when either of these assumptions is removed. We study the general problem and establish the following: For , we give a learning algorithm for general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Imbalanced Data Classification Techniques
MethodsFLIP
