Statistical Query Lower Bounds for Smoothed Agnostic Learning
Ilias Diakonikolas, Daniel M. Kane

TL;DR
This paper establishes near-tight statistical query lower bounds for smoothed agnostic learning of halfspaces, showing that current algorithms are close to optimal in complexity under Gaussian perturbations.
Contribution
It provides the first non-trivial SQ lower bounds for smoothed agnostic learning of halfspaces, nearly matching existing upper bounds and demonstrating the optimality of $L_1$-polynomial regression.
Findings
SQ lower bounds match upper bounds up to constants
Applying $L_1$-polynomial regression is essentially optimal
Lower bounds are proved via moment-matching and polynomial approximation techniques
Abstract
We study the complexity of smoothed agnostic learning, recently introduced by~\cite{CKKMS24}, in which the learner competes with the best classifier in a target class under slight Gaussian perturbations of the inputs. Specifically, we focus on the prototypical task of agnostically learning halfspaces under subgaussian distributions in the smoothed model. The best known upper bound for this problem relies on -polynomial regression and has complexity , where is the smoothing parameter and is the excess error. Our main result is a Statistical Query (SQ) lower bound providing formal evidence that this upper bound is close to best possible. In more detail, we show that (even for Gaussian marginals) any SQ algorithm for smoothed agnostic learning of halfspaces requires complexity .…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques
