Sample-Optimal PAC Learning of Halfspaces with Malicious Noise
Jie Shen

TL;DR
This paper improves the sample complexity bounds for PAC learning of halfspaces under malicious noise, achieving near-optimal bounds with a novel analysis and extending results to stronger noise models.
Contribution
It presents a new analysis of an existing algorithm that achieves near-optimal sample complexity for learning halfspaces with malicious noise, and extends the approach to more challenging noise models.
Findings
Achieves near-optimal sample complexity of d in isotropic log-concave distributions.
Introduces a matrix Chernoff-type inequality for covariance matrix analysis.
Extends the algorithm to handle stronger nasty noise models.
Abstract
We study efficient PAC learning of homogeneous halfspaces in in the presence of malicious noise of Valiant (1985). This is a challenging noise model and only until recently has near-optimal noise tolerance bound been established under the mild condition that the unlabeled data distribution is isotropic log-concave. However, it remains unsettled how to obtain the optimal sample complexity simultaneously. In this work, we present a new analysis for the algorithm of Awasthi et al. (2017) and show that it essentially achieves the near-optimal sample complexity bound of , improving the best known result of . Our main ingredient is a novel incorporation of a matrix Chernoff-type inequality to bound the spectrum of an empirical covariance matrix for well-behaved distributions, in conjunction with a careful exploration of the localization schemes of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
