Agnostic learning in (almost) optimal time via Gaussian surface area
Lucas Pesenti, Lucas Slot, Manuel Wiedmer

TL;DR
This paper improves the bounds on the complexity of agnostically learning concept classes with bounded Gaussian surface area, achieving near-optimal time complexity by refining polynomial approximation techniques.
Contribution
It refines the analysis of Gaussian surface area-based learning, reducing the polynomial degree needed for approximation from $O(rac{ ext{surface area}^2}{ ext{error}^4})$ to $ ilde O(rac{ ext{surface area}^2}{ ext{error}^2})$, leading to near-optimal bounds.
Findings
Achieved near-optimal bounds for agnostic learning of polynomial threshold functions.
Improved polynomial degree bounds for $L_1$-approximation in Gaussian space.
Extended analysis to match lower bounds in the statistical query model.
Abstract
The complexity of learning a concept class under Gaussian marginals in the difficult agnostic model is closely related to its -approximability by low-degree polynomials. For any concept class with Gaussian surface area at most , Klivans et al. (2008) show that degree suffices to achieve an -approximation. This leads to the best-known bounds on the complexity of learning a variety of concept classes. In this note, we improve their analysis by showing that degree is enough. In light of lower bounds due to Diakonikolas et al. (2021), this yields (near) optimal bounds on the complexity of agnostically learning polynomial threshold functions in the statistical query model. Our proof relies on a direct analogue of a construction of Feldman et al. (2020), who considered -approximation on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Stochastic Gradient Optimization Techniques
