Rate-Distortion Bounds on Bayes Risk in Supervised Learning
Matthew Nokleby, Ahmad Beirami, and Robert Calderbank

TL;DR
This paper introduces an information-theoretic framework to bound the number of samples needed for supervised learning, linking rate-distortion theory with Bayesian classifier risk to derive tight bounds on learning performance.
Contribution
It develops a novel rate-distortion approach to analyze Bayesian supervised learning, providing tight bounds on classifier risk based on information measures and model complexity.
Findings
Bounds are tight orderwise for various models.
Framework complements PAC bounds by focusing on average risk.
Uses Fisher information to relate data quantity to classifier accuracy.
Abstract
We present an information-theoretic framework for bounding the number of labeled samples needed to train a classifier in a parametric Bayesian setting. We derive bounds on the average distance between the learned classifier and the true maximum a posteriori classifier, which are well-established surrogates for the excess classification error due to imperfect learning. We provide lower and upper bounds on the rate-distortion function, using loss as the distortion measure, of a maximum a priori classifier in terms of the differential entropy of the posterior distribution and a quantity called the interpolation dimension, which characterizes the complexity of the parametric distribution family. In addition to expressing the information content of a classifier in terms of lossy compression, the rate-distortion function also expresses the minimum number of bits a learning machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and Data Classification
