Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via   PAC-Bayesian Theory on Random Sets

Benjamin Dupuis; Paul Viallard; George Deligiannidis; Umut Simsekli

arXiv:2404.17442·stat.ML·February 11, 2025

Uniform Generalization Bounds on Data-Dependent Hypothesis Sets via PAC-Bayesian Theory on Random Sets

Benjamin Dupuis, Paul Viallard, George Deligiannidis, Umut Simsekli

PDF

TL;DR

This paper introduces data-dependent uniform generalization bounds using PAC-Bayesian theory on random sets, providing tighter bounds and insights into noisy algorithms like Langevin dynamics.

Contribution

It develops a PAC-Bayesian framework on random sets for data-dependent hypothesis sets, unifies fractal-dimension bounds, and analyzes generalization of Langevin dynamics.

Findings

01

Tighter fractal-dimension-based generalization bounds.

02

Uniform bounds over Langevin dynamics trajectories.

03

Insights into noisy algorithm generalization.

Abstract

We propose data-dependent uniform generalization bounds by approaching the problem from a PAC-Bayesian perspective. We first apply the PAC-Bayesian framework on "random sets" in a rigorous way, where the training algorithm is assumed to output a data-dependent hypothesis set after observing the training data. This approach allows us to prove data-dependent bounds, which can be applicable in numerous contexts. To highlight the power of our approach, we consider two main applications. First, we propose a PAC-Bayesian formulation of the recently developed fractal-dimension-based generalization bounds. The derived results are shown to be tighter and they unify the existing results around one simple proof technique. Second, we prove uniform bounds over the trajectories of continuous Langevin dynamics and stochastic gradient Langevin dynamics. These results provide novel information about the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training