Universal Approximation of Markov Kernels by Shallow Stochastic Feedforward Networks
Guido Montufar

TL;DR
This paper proves that shallow stochastic neural networks with sigmoid activations can universally approximate any Markov kernel, providing explicit bounds on the number of hidden units needed for such approximation.
Contribution
It establishes the minimal hidden units required for universal approximation of Markov kernels by shallow stochastic networks with sigmoid activation.
Findings
Explicit upper bounds on hidden units for universal approximation.
Approximation of any probabilistic assignment of output states given input states.
Provides theoretical foundation for the expressive power of shallow stochastic networks.
Abstract
We establish upper bounds for the minimal number of hidden units for which a binary stochastic feedforward network with sigmoid activation probabilities and a single hidden layer is a universal approximator of Markov kernels. We show that each possible probabilistic assignment of the states of output units, given the states of input units, can be approximated arbitrarily well by a network with hidden units.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Stochastic Gradient Optimization Techniques
MethodsDense Connections · Feedforward Network · Sigmoid Activation
