Wide Neural Networks with Bottlenecks are Deep Gaussian Processes
Devanshu Agrawal, Theodore Papamarkou, Jacob Hinkle

TL;DR
This paper extends the understanding of Bayesian neural networks by analyzing their wide limits when some hidden layers, called bottlenecks, are kept narrow, resulting in a composition of Gaussian processes with unique properties.
Contribution
It introduces the concept of bottleneck neural network Gaussian processes (bottleneck NNGPs) and proves their properties, extending the wide limit analysis to architectures with narrow layers.
Findings
Bottleneck NNGPs are compositions of Gaussian processes.
Bottlenecks induce persistent dependence between outputs.
Kernel discriminative power is preserved at deep bottlenecks.
Abstract
There has recently been much work on the "wide limit" of neural networks, where Bayesian neural networks (BNNs) are shown to converge to a Gaussian process (GP) as all hidden layers are sent to infinite width. However, these results do not apply to architectures that require one or more of the hidden layers to remain narrow. In this paper, we consider the wide limit of BNNs where some hidden layers, called "bottlenecks", are held at finite width. The result is a composition of GPs that we term a "bottleneck neural network Gaussian process" (bottleneck NNGP). Although intuitive, the subtlety of the proof is in showing that the wide limit of a composition of networks is in fact the composition of the limiting GPs. We also analyze theoretically a single-bottleneck NNGP, finding that the bottleneck induces dependence between the outputs of a multi-output network that persists through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms
MethodsGaussian Process
