Fine-Grained Distribution-Dependent Learning Curves
Olivier Bousquet, Steve Hanneke, Shay Moran, Jonathan Shafer, Ilya, Tolstikhin

TL;DR
This paper introduces a new combinatorial dimension called VCL to characterize distribution-dependent learning curves, revealing a decomposition into linear and exponential components and strengthening existing lower bounds.
Contribution
It defines VCL, a novel combinatorial measure, to precisely characterize the best possible distribution-dependent lower bounds for learning curves, refining prior theories.
Findings
VCL characterizes the strongest minimax lower bounds for learning curves.
Learning rates decompose into linear and exponential components based on VCL.
The results recover known bounds for half-spaces in -dimensional space.
Abstract
Learning curves plot the expected error of a learning algorithm as a function of the number of labeled samples it receives from a target distribution. They are widely used as a measure of an algorithm's performance, but classic PAC learning theory cannot explain their behavior. As observed by Antos and Lugosi (1996 , 1998), the classic `No Free Lunch' lower bounds only trace the upper envelope above all learning curves of specific target distributions. For a concept class with VC dimension the classic bound decays like , yet it is possible that the learning curve for \emph{every} specific distribution decays exponentially. In this case, for each there exists a different `hard' distribution requiring samples. Antos and Lugosi asked which concept classes admit a `strong minimax lower bound' -- a lower bound of that holds for a fixed distribution for infinitely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques
