A Direct Sum Result for the Information Complexity of Learning
Ido Nachum, Jonathan Shafer, Amir Yehudayoff

TL;DR
This paper establishes a lower bound on the mutual information needed for PAC learning classes with VC dimension d, showing that the information complexity scales with d log log(|X|/d), and proves a direct sum property for this complexity.
Contribution
It introduces a lower bound on the information complexity for classes with VC dimension d and proves a direct sum theorem for the information complexity of combined classes.
Findings
Lower bound of Ω(d log log(|X|/d)) bits for information complexity
Information complexity sums when combining multiple classes
Generalization of previous results for VC dimension d
Abstract
How many bits of information are required to PAC learn a class of hypotheses of VC dimension ? The mathematical setting we follow is that of Bassily et al. (2018), where the value of interest is the mutual information between the input sample and the hypothesis outputted by the learning algorithm . We introduce a class of functions of VC dimension over the domain with information complexity at least bits for any consistent and proper algorithm (deterministic or random). Bassily et al. proved a similar (but quantitatively weaker) result for the case . The above result is in fact a special case of a more general phenomenon we explore. We define the notion of information complexity of a given class of functions . Intuitively, it is the minimum amount of information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Computability, Logic, AI Algorithms
