Assessing the Robustness of Bayesian Dark Knowledge to Posterior   Uncertainty

Meet P. Vadera; Benjamin M. Marlin

arXiv:1906.01724·cs.LG·June 11, 2019·1 cites

Assessing the Robustness of Bayesian Dark Knowledge to Posterior Uncertainty

Meet P. Vadera, Benjamin M. Marlin

PDF

Open Access

TL;DR

This paper investigates how Bayesian Dark Knowledge, a method for compressing neural network posteriors, performs under high posterior uncertainty and explores increasing model capacity to improve robustness.

Contribution

The study reveals limitations of Bayesian Dark Knowledge with high posterior uncertainty and proposes increasing student model capacity to enhance performance.

Findings

01

Performance degrades with higher posterior uncertainty.

02

Matching student architecture to teacher may fail under uncertainty.

03

Increasing student capacity improves robustness.

Abstract

Bayesian Dark Knowledge is a method for compressing the posterior predictive distribution of a neural network model into a more compact form. Specifically, the method attempts to compress a Monte Carlo approximation to the parameter posterior into a single network representing the posterior predictive distribution. Further, the authors show that this approach is successful in the classification setting using a student network whose architecture matches that of a single network in the teacher ensemble. In this work, we examine the robustness of Bayesian Dark Knowledge to higher levels of posterior uncertainty. We show that using a student network that matches the teacher architecture may fail to yield acceptable performance. We study an approach to close the resulting performance gap by increasing student model capacity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms