Maximal Information Divergence from Statistical Models defined by Neural   Networks

Guido Montufar; Johannes Rauh; Nihat Ay

arXiv:1303.0268·math.ST·June 18, 2014·GSI

Maximal Information Divergence from Statistical Models defined by Neural Networks

Guido Montufar, Johannes Rauh, Nihat Ay

PDF

TL;DR

This paper reviews recent findings on the maximum Kullback-Leibler divergence from neural network-based statistical models, providing methods to compute these divergences and presenting new results for specific deep belief networks.

Contribution

It offers a comprehensive review of maximal divergence values for various neural network models and introduces new results for deep and narrow belief networks.

Findings

01

Maximal divergence values are characterized for several neural network models.

02

Methods to compute divergence from simple sub- or super-models are illustrated.

03

New results are provided for deep and narrow belief networks with finite units.

Abstract

We review recent results about the maximal values of the Kullback-Leibler information divergence from statistical models defined by neural networks, including naive Bayes models, restricted Boltzmann machines, deep belief networks, and various classes of exponential families. We illustrate approaches to compute the maximal divergence from a given model starting from simple sub- or super-models. We give a new result for deep and narrow belief networks with finite-valued units.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.