The Power of Iterative Filtering for Supervised Learning with (Heavy) Contamination
Adam R. Klivans, Konstantinos Stavropoulos, Kevin Tian, Arsen Vasilyan

TL;DR
This paper introduces an iterative polynomial filtering algorithm that significantly advances supervised learning under contamination, enabling efficient learning of halfspaces and other functions despite heavy adversarial noise.
Contribution
It provides the first efficient algorithms for learning with contamination for various function classes, including halfspaces, under different distributional assumptions.
Findings
Efficient learning of halfspaces with bounded contamination up to error 2η+ε.
Near-optimal guarantees for classes with sandwiching approximators under heavy additive contamination.
First algorithms for tolerant testable learning of halfspaces on log-concave distributions.
Abstract
Inspired by recent work on learning with distribution shift, we give a general outlier removal algorithm called iterative polynomial filtering and show a number of striking applications for supervised learning with contamination: (1) We show that any function class that can be approximated by low-degree polynomials with respect to a hypercontractive distribution can be efficiently learned under bounded contamination (also known as nasty noise). This is a surprising resolution to a longstanding gap between the complexity of agnostic learning and learning with contamination, as it was widely believed that low-degree approximators only implied tolerance to label noise. In particular, it implies the first efficient algorithm for learning halfspaces with -bounded contamination up to error with respect to the Gaussian distribution. (2) For any function class that admits…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTarget Tracking and Data Fusion in Sensor Networks · Speech and Audio Processing · Flow Measurement and Analysis
MethodsSparse Evolutionary Training
