Optimal SQ Lower Bounds for Learning Halfspaces with Massart Noise

Rajai Nasser; Stefan Tiegel

arXiv:2201.09818·cs.LG·January 25, 2022

Optimal SQ Lower Bounds for Learning Halfspaces with Massart Noise

Rajai Nasser, Stefan Tiegel

PDF

Open Access

TL;DR

This paper establishes tight statistical query lower bounds for learning halfspaces with Massart noise, demonstrating the computational difficulty of achieving low error under certain noise conditions, matching existing algorithms.

Contribution

It provides the first tight SQ lower bounds for learning halfspaces with Massart noise, even when the optimal error is exponentially small, confirming the computational limits of such learning tasks.

Findings

01

SQ algorithms require superpolynomial accuracy or queries for error below η.

02

Lower bounds hold even when the optimal error is exponentially small.

03

Achieving error less than 1/2 in the Tsybakov model is SQ-hard.

Abstract

We give tight statistical query (SQ) lower bounds for learnining halfspaces in the presence of Massart noise. In particular, suppose that all labels are corrupted with probability at most $η$ . We show that for arbitrary $η \in [0, 1/2]$ every SQ algorithm achieving misclassification error better than $η$ requires queries of superpolynomial accuracy or at least a superpolynomial number of queries. Further, this continues to hold even if the information-theoretically optimal error $OPT$ is as small as $exp (- lo g^{c} (d))$ , where $d$ is the dimension and $0 < c < 1$ is an arbitrary absolute constant, and an overwhelming fraction of examples are noiseless. Our lower bound matches known polynomial time algorithms, which are also implementable in the SQ framework. Previously, such lower bounds only ruled out algorithms achieving error $OPT + ϵ$ or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression