Sample compression schemes for VC classes

Shay Moran; Amir Yehudayoff

arXiv:1503.06960·cs.LG·April 15, 2015

Sample compression schemes for VC classes

Shay Moran, Amir Yehudayoff

PDF

Open Access

TL;DR

This paper proves that any concept class with VC dimension d has a sample compression scheme of size exponential in d, linking compression schemes to VC theory and PAC learnability.

Contribution

It establishes that all VC classes admit exponential-size sample compression schemes, answering a longstanding question in learning theory.

Findings

01

Every VC class has a sample compression scheme of size exponential in VC dimension.

02

The proof involves an approximate minimax phenomenon for low VC dimension binary matrices.

03

Results connect sample compression, VC theory, and game theory concepts.

Abstract

Sample compression schemes were defined by Littlestone and Warmuth (1986) as an abstraction of the structure underlying many learning algorithms. Roughly speaking, a sample compression scheme of size $k$ means that given an arbitrary list of labeled examples, one can retain only $k$ of them in a way that allows to recover the labels of all other examples in the list. They showed that compression implies PAC learnability for binary-labeled classes, and asked whether the other direction holds. We answer their question and show that every concept class $C$ with VC dimension $d$ has a sample compression scheme of size exponential in $d$ . The proof uses an approximate minimax phenomenon for binary matrices of low VC dimension, which may be of interest in the context of game theory.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Computability, Logic, AI Algorithms · Complexity and Algorithms in Graphs