Fine-Grained Distribution-Dependent Learning Curves

Olivier Bousquet; Steve Hanneke; Shay Moran; Jonathan Shafer; Ilya; Tolstikhin

arXiv:2208.14615·cs.LG·November 14, 2022

Fine-Grained Distribution-Dependent Learning Curves

Olivier Bousquet, Steve Hanneke, Shay Moran, Jonathan Shafer, Ilya, Tolstikhin

PDF

Open Access

TL;DR

This paper introduces a new combinatorial dimension called VCL to characterize distribution-dependent learning curves, revealing a decomposition into linear and exponential components and strengthening existing lower bounds.

Contribution

It defines VCL, a novel combinatorial measure, to precisely characterize the best possible distribution-dependent lower bounds for learning curves, refining prior theories.

Findings

01

VCL characterizes the strongest minimax lower bounds for learning curves.

02

Learning rates decompose into linear and exponential components based on VCL.

03

The results recover known bounds for half-spaces in -dimensional space.

Abstract

Learning curves plot the expected error of a learning algorithm as a function of the number of labeled samples it receives from a target distribution. They are widely used as a measure of an algorithm's performance, but classic PAC learning theory cannot explain their behavior. As observed by Antos and Lugosi (1996 , 1998), the classic `No Free Lunch' lower bounds only trace the upper envelope above all learning curves of specific target distributions. For a concept class with VC dimension $d$ the classic bound decays like $d / n$ , yet it is possible that the learning curve for \emph{every} specific distribution decays exponentially. In this case, for each $n$ there exists a different `hard' distribution requiring $d / n$ samples. Antos and Lugosi asked which concept classes admit a `strong minimax lower bound' -- a lower bound of $d^{'} / n$ that holds for a fixed distribution for infinitely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques