A Theory of Universal Agnostic Learning
Steve Hanneke, Shay Moran

TL;DR
This paper develops a comprehensive theory for the optimal rates of binary classification in the agnostic setting, removing the realizability assumption and classifying concept classes into four fundamental rate categories.
Contribution
It extends previous realizable-case theories to the agnostic setting and introduces a tetrachotomy classification for the optimal convergence rates based on combinatorial structures.
Findings
Identifies four fundamental categories of convergence rates.
Provides a complete characterization of optimal rates for any concept class.
Links combinatorial structures to the classification of rate categories.
Abstract
We provide a complete theory of optimal universal rates for binary classification in the agnostic setting. This extends the realizable-case theory of Bousquet, Hanneke, Moran, van Handel, and Yehudayoff (2021) by removing the realizability assumption on the distribution. We identify a fundamental tetrachotomy of optimal rates: for every concept class, the optimal universal rate of convergence of the excess error rate is one of , , , or arbitrarily slow. We further identify simple combinatorial structures which determine which of these categories any given concept class falls into.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Computability, Logic, AI Algorithms · Bayesian Methods and Mixture Models
