Tighter risk certificates for neural networks
Mar\'ia P\'erez-Ortiz, Omar Rivasplata, John Shawe-Taylor and, Csaba Szepesv\'ari

TL;DR
This paper empirically investigates training probabilistic neural networks using novel PAC-Bayes derived objectives, achieving tighter risk bounds and promising self-certified learning capabilities on datasets like MNIST and CIFAR-10.
Contribution
Introduces two new training objectives based on tight PAC-Bayes bounds for probabilistic neural networks and compares them with classical bounds.
Findings
Achieves non-vacuous, tighter risk bounds than previous work.
Produces competitive test errors on MNIST and CIFAR-10.
Shows potential for self-certified learning without separate test sets.
Abstract
This paper presents an empirical study regarding training probabilistic neural networks using training objectives derived from PAC-Bayes bounds. In the context of probabilistic neural networks, the output of training is a probability distribution over network weights. We present two training objectives, used here for the first time in connection with training neural networks. These two training objectives are derived from tight PAC-Bayes bounds. We also re-implement a previously used training objective based on a classical PAC-Bayes bound, to compare the properties of the predictors learned using the different training objectives. We compute risk certificates for the learnt predictors, based on part of the data used to learn the predictors. We further experiment with different types of priors on the weights (both data-free and data-dependent priors) and neural network architectures. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Machine Learning and Algorithms
