Unifying Variational Inference and PAC-Bayes for Supervised Learning   that Scales

Sanjay Thakur; Herke Van Hoof; Gunshi Gupta; David Meger

arXiv:1910.10367·stat.ML·December 18, 2019·1 cites

Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

Sanjay Thakur, Herke Van Hoof, Gunshi Gupta, David Meger

PDF

Open Access 1 Repo

TL;DR

This paper introduces a scalable method combining variational inference and PAC-Bayes frameworks to improve generalization in high-dimensional neural network controllers, validated on MuJoCo tasks.

Contribution

It unifies variational inference and PAC-Bayes for scalable, high-dimensional supervised learning without explicit environmental assumptions.

Findings

01

Better generalization in high-dimensional tasks

02

Theoretical validation of the combined approach

03

Effective on MuJoCo locomotion benchmarks

Abstract

Neural Network based controllers hold enormous potential to learn complex, high-dimensional functions. However, they are prone to overfitting and unwarranted extrapolations. PAC Bayes is a generalized framework which is more resistant to overfitting and that yields performance bounds that hold with arbitrarily high probability even on the unjustified extrapolations. However, optimizing to learn such a function and a bound is intractable for complex tasks. In this work, we propose a method to simultaneously learn such a function and estimate performance bounds that scale organically to high-dimensions, non-linear environments without making any explicit assumptions about the environment. We build our approach on a parallel that we draw between the formulations called ELBO and PAC Bayes when the risk metric is negative log likelihood. Through our experiments on multiple high dimensional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sanjaythakur/Unifying-VI-and-PAC-Bayes-for-Learning-that-Scales
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Gaussian Processes and Bayesian Inference