Variational Inference for Infinitely Deep Neural Networks
Achille Nazaret, David Blei

TL;DR
This paper introduces the unbounded depth neural network (UDN), a probabilistic model with infinite layers that adapts its complexity to data, and develops a variational inference algorithm to efficiently approximate its posterior distribution.
Contribution
The paper presents a novel variational inference method for infinitely deep neural networks, enabling adaptive depth modeling without predefined truncation.
Findings
UDN adapts its depth to data complexity
Outperforms standard neural networks of similar complexity
Outperforms existing approaches to infinite-depth neural networks
Abstract
We introduce the unbounded depth neural network (UDN), an infinitely deep probabilistic model that adapts its complexity to the training data. The UDN contains an infinite sequence of hidden layers and places an unbounded prior on a truncation L, the layer from which it produces its data. Given a dataset of observations, the posterior UDN provides a conditional distribution of both the parameters of the infinite neural network and its truncation. We develop a novel variational inference algorithm to approximate this posterior, optimizing a distribution of the neural network weights and of the truncation depth L, and without any upper limit on L. To this end, the variational family has a special structure: it models neural network weights of arbitrary depth, and it dynamically creates or removes free variational parameters as its distribution of the truncation is optimized. (Unlike…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
MethodsVariational Inference
