Statistical Learning of Arbitrary Computable Classifiers
David Soloveichik

TL;DR
This paper explores the challenge of learning all computable classifiers, demonstrating that while learning is possible, bounding sample complexity independently of the data distribution is impossible due to computability constraints.
Contribution
It introduces a learning algorithm for all computable classifiers and proves the impossibility of distribution-independent sample complexity bounds under computability constraints.
Findings
Learning over all computable classifiers is feasible.
Distribution-independent sample complexity bounds are impossible.
Computability constraints are the main obstacle to uniform bounds.
Abstract
Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to learn to a specified level of accuracy. Here we consider learning over the set of all computable labeling functions. Since the VC-dimension is infinite and a priori (uniform) bounds on the number of samples are impossible, we let the learning algorithm decide when it has seen sufficient samples to have learned. We first show that learning in this setting is indeed possible, and develop a learning algorithm. We then show, however, that bounding sample complexity independently of the distribution is impossible. Notably, this impossibility is entirely due to the requirement that the learning algorithm be computable, and not due to the statistical nature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Computability, Logic, AI Algorithms · Algorithms and Data Compression
