A Direct Sum Result for the Information Complexity of Learning

Ido Nachum; Jonathan Shafer; Amir Yehudayoff

arXiv:1804.05474·cs.LG·April 20, 2018·1 cites

A Direct Sum Result for the Information Complexity of Learning

Ido Nachum, Jonathan Shafer, Amir Yehudayoff

PDF

Open Access

TL;DR

This paper establishes a lower bound on the mutual information needed for PAC learning classes with VC dimension d, showing that the information complexity scales with d log log(|X|/d), and proves a direct sum property for this complexity.

Contribution

It introduces a lower bound on the information complexity for classes with VC dimension d and proves a direct sum theorem for the information complexity of combined classes.

Findings

01

Lower bound of Ω(d log log(|X|/d)) bits for information complexity

02

Information complexity sums when combining multiple classes

03

Generalization of previous results for VC dimension d

Abstract

How many bits of information are required to PAC learn a class of hypotheses of VC dimension $d$ ? The mathematical setting we follow is that of Bassily et al. (2018), where the value of interest is the mutual information $I (S; A (S))$ between the input sample $S$ and the hypothesis outputted by the learning algorithm $A$ . We introduce a class of functions of VC dimension $d$ over the domain $X$ with information complexity at least $Ω (d lo g lo g \frac{∣ X ∣}{d})$ bits for any consistent and proper algorithm (deterministic or random). Bassily et al. proved a similar (but quantitatively weaker) result for the case $d = 1$ . The above result is in fact a special case of a more general phenomenon we explore. We define the notion of information complexity of a given class of functions $H$ . Intuitively, it is the minimum amount of information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Computability, Logic, AI Algorithms