Higher-Order Generalization Bounds: Learning Deep Probabilistic Programs   via PAC-Bayes Objectives

Jonathan Warrell; Mark Gerstein

arXiv:2203.15972·cs.LG·March 31, 2022

Higher-Order Generalization Bounds: Learning Deep Probabilistic Programs via PAC-Bayes Objectives

Jonathan Warrell, Mark Gerstein

PDF

Open Access

TL;DR

This paper introduces a framework leveraging Deep Probabilistic Programming and PAC-Bayes bounds to derive and optimize higher-order generalization bounds, improving model performance and generalization in complex probabilistic models.

Contribution

It presents a novel approach to represent and learn higher-order PAC-Bayes bounds within DPP, enabling principled training objectives for complex probabilistic programs.

Findings

01

Improved generalization performance on synthetic and biological data.

02

Effective optimization of higher-order bounds using variational techniques.

03

Flexible DPP models with learned complexity measures enhance prediction accuracy.

Abstract

Deep Probabilistic Programming (DPP) allows powerful models based on recursive computation to be learned using efficient deep-learning optimization techniques. Additionally, DPP offers a unified perspective, where inference and learning algorithms are treated on a par with models as stochastic programs. Here, we offer a framework for representing and learning flexible PAC-Bayes bounds as stochastic programs using DPP-based methods. In particular, we show that DPP techniques may be leveraged to derive generalization bounds that draw on the compositionality of DPP representations. In turn, the bounds we introduce offer principled training objectives for higher-order probabilistic programs. We offer a definition of a higher-order generalization bound, which naturally encompasses single- and multi-task generalization perspectives (including transfer- and meta-learning) and a novel class of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms