A Fully Polynomial-Time Algorithm for Robustly Learning Halfspaces over the Hypercube
Gautam Chandrasekaran, Adam R. Klivans, Konstantinos Stavropoulos, Arsen Vasilyan

TL;DR
This paper introduces the first fully polynomial-time algorithm for robustly learning halfspaces over the hypercube under adversarial contamination, achieving near-optimal error guarantees without relying on continuous distribution properties.
Contribution
It presents a novel polynomial-time algorithm for robust halfspace learning on the hypercube, overcoming previous limitations related to distribution assumptions and dependence on the Lipschitz constant.
Findings
Achieves error of η^{O(1)} + ε under adversarial noise
Operates efficiently with polylogarithmic dependence on activation Lipschitz constant
Extends robust learning techniques to discrete distributions
Abstract
We give the first fully polynomial-time algorithm for learning halfspaces with respect to the uniform distribution on the hypercube in the presence of contamination, where an adversary may corrupt some fraction of examples and labels arbitrarily. We achieve an error guarantee of where is the noise rate. Such a result was not known even in the agnostic setting, where only labels can be adversarially corrupted. All prior work over the last two decades has a superpolynomial dependence in or succeeds only with respect to continuous marginals (such as log-concave densities). Previous analyses rely heavily on various structural properties of continuous distributions such as anti-concentration. Our approach avoids these requirements and makes use of a new algorithm for learning Generalized Linear Models (GLMs) with only a polylogarithmic dependence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Stochastic Gradient Optimization Techniques
