Near-Optimal Bounds for Learning Gaussian Halfspaces with Random   Classification Noise

Ilias Diakonikolas; Jelena Diakonikolas; Daniel M. Kane; Puqian Wang,; Nikos Zarifis

arXiv:2307.08438·cs.LG·July 18, 2023

Near-Optimal Bounds for Learning Gaussian Halfspaces with Random Classification Noise

Ilias Diakonikolas, Jelena Diakonikolas, Daniel M. Kane, Puqian Wang,, Nikos Zarifis

PDF

Open Access 1 Video

TL;DR

This paper investigates the problem of learning Gaussian halfspaces with random classification noise, establishing nearly-matching upper and lower bounds that reveal an inherent information-computation gap and the complexity of the task.

Contribution

It provides the first nearly tight bounds for learning Gaussian halfspaces with noise, highlighting an information-computation gap and the limitations of efficient algorithms.

Findings

01

Sample complexity is $ ilde{ heta}(d/ extepsilon)$.

02

Efficient algorithms require $ ilde{O}(d/ extepsilon + d/( extmaxigrace{p, extepsilonigrace})^2)$ samples.

03

Any SQ algorithm needs at least $ ilde{ heta}(d^{1/2}/( extmaxigrace{p, extepsilonigrace})^2)$ samples.

Abstract

We study the problem of learning general (i.e., not necessarily homogeneous) halfspaces with Random Classification Noise under the Gaussian distribution. We establish nearly-matching algorithmic and Statistical Query (SQ) lower bound results revealing a surprising information-computation gap for this basic problem. Specifically, the sample complexity of this learning problem is $Θ (d / ϵ)$ , where $d$ is the dimension and $ϵ$ is the excess error. Our positive result is a computationally efficient learning algorithm with sample complexity $\tilde{O} (d / ϵ + d / (max {p, ϵ})^{2})$ , where $p$ quantifies the bias of the target halfspace. On the lower bound side, we show that any efficient SQ algorithm (or low-degree test) for the problem requires sample complexity at least $Ω (d^{1/2} / (max {p, ϵ})^{2})$ . Our lower bound suggests that this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Near-Optimal Bounds for Learning Gaussian Halfspaces with Random Classification Noise· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification