Generalization Bounds via Meta-Learned Model Representations: PAC-Bayes and Sample Compression Hypernetworks

Benjamin Leblanc; Mathieu Bazinet; Nathaniel D'Amours; Alexandre Drouin; Pascal Germain

arXiv:2410.13577·cs.LG·June 6, 2025

Generalization Bounds via Meta-Learned Model Representations: PAC-Bayes and Sample Compression Hypernetworks

Benjamin Leblanc, Mathieu Bazinet, Nathaniel D'Amours, Alexandre Drouin, Pascal Germain

PDF

Open Access

TL;DR

This paper introduces a meta-learning framework using hypernetworks that encode datasets to produce neural network parameters, enabling tight generalization bounds via PAC-Bayesian and Sample Compression theories.

Contribution

It proposes novel hypernetwork architectures that encode datasets for meta-learning, providing new generalization guarantees through PAC-Bayesian and Sample Compression methods.

Findings

01

Developed three hypernetwork architectures encoding datasets before decoding parameters.

02

Derived new generalization bounds for neural networks using these architectures.

03

Established theoretical guarantees for downstream predictors in a meta-learning context.

Abstract

Both PAC-Bayesian and Sample Compress learning frameworks are instrumental for deriving tight (non-vacuous) generalization bounds for neural networks. We leverage these results in a meta-learning scheme, relying on a hypernetwork that outputs the parameters of a downstream predictor from a dataset input. The originality of our approach lies in the investigated hypernetwork architectures that encode the dataset before decoding the parameters: (1) a PAC-Bayesian encoder that expresses a posterior distribution over a latent space, (2) a Sample Compress encoder that selects a small sample of the dataset input along with a message from a discrete set, and (3) a hybrid between both approaches motivated by a new Sample Compress theorem handling continuous messages. The latter theorem exploits the pivotal information transiting at the encoder-decoder junction in order to compute generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications

MethodsHyperNetwork · Sparse Evolutionary Training