Optimal SQ Lower Bounds for Learning Halfspaces with Massart Noise
Rajai Nasser, Stefan Tiegel

TL;DR
This paper establishes tight statistical query lower bounds for learning halfspaces with Massart noise, demonstrating the computational difficulty of achieving low error under certain noise conditions, matching existing algorithms.
Contribution
It provides the first tight SQ lower bounds for learning halfspaces with Massart noise, even when the optimal error is exponentially small, confirming the computational limits of such learning tasks.
Findings
SQ algorithms require superpolynomial accuracy or queries for error below η.
Lower bounds hold even when the optimal error is exponentially small.
Achieving error less than 1/2 in the Tsybakov model is SQ-hard.
Abstract
We give tight statistical query (SQ) lower bounds for learnining halfspaces in the presence of Massart noise. In particular, suppose that all labels are corrupted with probability at most . We show that for arbitrary every SQ algorithm achieving misclassification error better than requires queries of superpolynomial accuracy or at least a superpolynomial number of queries. Further, this continues to hold even if the information-theoretically optimal error is as small as , where is the dimension and is an arbitrary absolute constant, and an overwhelming fraction of examples are noiseless. Our lower bound matches known polynomial time algorithms, which are also implementable in the SQ framework. Previously, such lower bounds only ruled out algorithms achieving error or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression
