Refinements of Universal Approximation Results for Deep Belief Networks   and Restricted Boltzmann Machines

Guido Montufar; Nihat Ay

arXiv:1005.1593·stat.ML·July 27, 2010·Neural Comput.

Refinements of Universal Approximation Results for Deep Belief Networks and Restricted Boltzmann Machines

Guido Montufar, Nihat Ay

PDF

Open Access

TL;DR

This paper refines the understanding of the resources needed by RBMs and DBNs to serve as universal approximators, providing tighter bounds and confirming a previous conjecture about their capabilities.

Contribution

It improves existing bounds on the number of hidden units and layers required for RBMs and DBNs to approximate any distribution on binary vectors.

Findings

01

RBMs with k-1 hidden units can approximate any distribution, where k is minimal based on support set union.

02

Constructed a DBN with 2^{n/2}(n - log(n)) hidden units per layer that can approximate any distribution.

03

Confirmed a conjecture by Le Roux and Bengio (2010) regarding DBN approximation capabilities.

Abstract

We improve recently published results about resources of Restricted Boltzmann Machines (RBM) and Deep Belief Networks (DBN) required to make them Universal Approximators. We show that any distribution p on the set of binary vectors of length n can be arbitrarily well approximated by an RBM with k-1 hidden units, where k is the minimal number of pairs of binary vectors differing in only one entry such that their union contains the support set of p. In important cases this number is half of the cardinality of the support set of p. We construct a DBN with 2^n/2(n-b), b ~ log(n), hidden layers of width n that is capable of approximating any distribution on {0,1}^n arbitrarily well. This confirms a conjecture presented by Le Roux and Bengio 2010.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning