SQ Lower Bounds for Learning Single Neurons with Massart Noise
Ilias Diakonikolas, Daniel M. Kane, Lisheng Ren, Yuxin Sun

TL;DR
This paper proves super-polynomial SQ lower bounds for efficiently learning single neurons with Massart noise, including ReLUs, showing fundamental computational limitations in this setting.
Contribution
It introduces a novel SQ-hard construction for learning Massart halfspaces, establishing fundamental lower bounds for neural network learning with noise.
Findings
No efficient SQ algorithm can approximate the optimal error within any constant factor.
Super-polynomial SQ lower bounds are established for learning single neurons with Massart noise.
The construction is interesting for learning Massart halfspaces on the Boolean hypercube.
Abstract
We study the problem of PAC learning a single neuron in the presence of Massart noise. Specifically, for a known activation function , the learner is given access to labeled examples , where the marginal distribution of is arbitrary and the corresponding label is a Massart corruption of . The goal of the learner is to output a hypothesis with small squared loss. For a range of activation functions, including ReLUs, we establish super-polynomial Statistical Query (SQ) lower bounds for this learning problem. In more detail, we prove that no efficient SQ algorithm can approximate the optimal error within any constant factor. Our main technical contribution is a novel SQ-hard construction for learning $\{ \pm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Neural Networks and Applications
