Generalization Bounds via Meta-Learned Model Representations: PAC-Bayes and Sample Compression Hypernetworks
Benjamin Leblanc, Mathieu Bazinet, Nathaniel D'Amours, Alexandre Drouin, Pascal Germain

TL;DR
This paper introduces a meta-learning framework using hypernetworks that encode datasets to produce neural network parameters, enabling tight generalization bounds via PAC-Bayesian and Sample Compression theories.
Contribution
It proposes novel hypernetwork architectures that encode datasets for meta-learning, providing new generalization guarantees through PAC-Bayesian and Sample Compression methods.
Findings
Developed three hypernetwork architectures encoding datasets before decoding parameters.
Derived new generalization bounds for neural networks using these architectures.
Established theoretical guarantees for downstream predictors in a meta-learning context.
Abstract
Both PAC-Bayesian and Sample Compress learning frameworks are instrumental for deriving tight (non-vacuous) generalization bounds for neural networks. We leverage these results in a meta-learning scheme, relying on a hypernetwork that outputs the parameters of a downstream predictor from a dataset input. The originality of our approach lies in the investigated hypernetwork architectures that encode the dataset before decoding the parameters: (1) a PAC-Bayesian encoder that expresses a posterior distribution over a latent space, (2) a Sample Compress encoder that selects a small sample of the dataset input along with a message from a discrete set, and (3) a hybrid between both approaches motivated by a new Sample Compress theorem handling continuous messages. The latter theorem exploits the pivotal information transiting at the encoder-decoder junction in order to compute generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications
MethodsHyperNetwork · Sparse Evolutionary Training
