Higher-Order Generalization Bounds: Learning Deep Probabilistic Programs via PAC-Bayes Objectives
Jonathan Warrell, Mark Gerstein

TL;DR
This paper introduces a framework leveraging Deep Probabilistic Programming and PAC-Bayes bounds to derive and optimize higher-order generalization bounds, improving model performance and generalization in complex probabilistic models.
Contribution
It presents a novel approach to represent and learn higher-order PAC-Bayes bounds within DPP, enabling principled training objectives for complex probabilistic programs.
Findings
Improved generalization performance on synthetic and biological data.
Effective optimization of higher-order bounds using variational techniques.
Flexible DPP models with learned complexity measures enhance prediction accuracy.
Abstract
Deep Probabilistic Programming (DPP) allows powerful models based on recursive computation to be learned using efficient deep-learning optimization techniques. Additionally, DPP offers a unified perspective, where inference and learning algorithms are treated on a par with models as stochastic programs. Here, we offer a framework for representing and learning flexible PAC-Bayes bounds as stochastic programs using DPP-based methods. In particular, we show that DPP techniques may be leveraged to derive generalization bounds that draw on the compositionality of DPP representations. In turn, the bounds we introduce offer principled training objectives for higher-order probabilistic programs. We offer a definition of a higher-order generalization bound, which naturally encompasses single- and multi-task generalization perspectives (including transfer- and meta-learning) and a novel class of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
