User-friendly introduction to PAC-Bayes bounds
Pierre Alquier

TL;DR
This paper provides an accessible introduction to PAC-Bayes bounds, explaining their theoretical foundations, recent improvements, and applications, especially in neural network generalization.
Contribution
It offers a simplified, user-friendly explanation of PAC-Bayes bounds, including recent techniques like localization and mutual information bounds.
Findings
PAC-Bayes bounds have been significantly improved and simplified.
Recent applications include neural network generalization.
The paper clarifies complex concepts for broader understanding.
Abstract
Aggregated predictors are obtained by making a set of basic predictors vote according to some weights, that is, to some probability distribution. Randomized predictors are obtained by sampling in a set of basic predictors, according to some prescribed probability distribution. Thus, aggregated and randomized predictors have in common that they are not defined by a minimization problem, but by a probability distribution on the set of predictors. In statistical learning theory, there is a set of tools designed to understand the generalization ability of such procedures: PAC-Bayesian or PAC-Bayes bounds. Since the original PAC-Bayes bounds of D. McAllester, these tools have been considerably improved in many directions (we will for example describe a simplified version of the localization technique of O. Catoni that was missed by the community, and later rediscovered as "mutual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Fault Detection and Control Systems · Statistical Mechanics and Entropy
