User-friendly introduction to PAC-Bayes bounds

Pierre Alquier

arXiv:2110.11216·stat.ML·March 3, 2025·24 cites

User-friendly introduction to PAC-Bayes bounds

Pierre Alquier

PDF

Open Access

TL;DR

This paper provides an accessible introduction to PAC-Bayes bounds, explaining their theoretical foundations, recent improvements, and applications, especially in neural network generalization.

Contribution

It offers a simplified, user-friendly explanation of PAC-Bayes bounds, including recent techniques like localization and mutual information bounds.

Findings

01

PAC-Bayes bounds have been significantly improved and simplified.

02

Recent applications include neural network generalization.

03

The paper clarifies complex concepts for broader understanding.

Abstract

Aggregated predictors are obtained by making a set of basic predictors vote according to some weights, that is, to some probability distribution. Randomized predictors are obtained by sampling in a set of basic predictors, according to some prescribed probability distribution. Thus, aggregated and randomized predictors have in common that they are not defined by a minimization problem, but by a probability distribution on the set of predictors. In statistical learning theory, there is a set of tools designed to understand the generalization ability of such procedures: PAC-Bayesian or PAC-Bayes bounds. Since the original PAC-Bayes bounds of D. McAllester, these tools have been considerably improved in many directions (we will for example describe a simplified version of the localization technique of O. Catoni that was missed by the community, and later rediscovered as "mutual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fault Detection and Control Systems · Statistical Mechanics and Entropy