PAC-Bayes, MAC-Bayes and Conditional Mutual Information: Fast rate bounds that handle general VC classes
Peter Gr\"unwald, Thomas Steinke, Lydia Zakynthinou

TL;DR
This paper introduces a unified approach to derive fast rate generalization bounds for VC classes using PAC-Bayesian and mutual information methods, enabling improved rates under certain conditions.
Contribution
It provides a novel unified derivation of PAC-Bayesian and MI bounds that achieve faster convergence rates for general VC classes, extending previous work.
Findings
Enables nontrivial PAC-Bayes and MI bounds for VC classes.
Achieves faster rates of order (KL/n)^γ under Bernstein condition.
Extends prior work by incorporating fast rates with VC classes.
Abstract
We give a novel, unified derivation of conditional PAC-Bayesian and mutual information (MI) generalization bounds. We derive conditional MI bounds as an instance, with special choice of prior, of conditional MAC-Bayesian (Mean Approximately Correct) bounds, itself derived from conditional PAC-Bayesian bounds, where `conditional' means that one can use priors conditioned on a joint training and ghost sample. This allows us to get nontrivial PAC-Bayes and MI-style bounds for general VC classes, something recently shown to be impossible with standard PAC-Bayesian/MI bounds. Second, it allows us to get faster rates of order for if a Bernstein condition holds and for exp-concave losses (with ), which is impossible with both standard PAC-Bayes generalization and MI bounds. Our work extends the recent work by Steinke and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Anomaly Detection Techniques and Applications
