A Probabilistic Representation of Deep Learning

Xinjie Lan; Kenneth E. Barner

arXiv:1908.09772·cs.LG·August 27, 2019·1 cites

A Probabilistic Representation of Deep Learning

Xinjie Lan, Kenneth E. Barner

PDF

Open Access

TL;DR

This paper introduces a probabilistic framework for deep learning, interpreting neural networks as Gibbs distributions and Bayesian models, providing new insights into hierarchy and generalization.

Contribution

It presents a novel probabilistic representation of DNNs, linking neurons to Gibbs energy, and offers a Bayesian perspective on hierarchy and regularization in deep learning.

Findings

01

Neurons define the energy of a Gibbs distribution.

02

Hidden layers formulate Gibbs distributions.

03

DNNs can be interpreted as Bayesian neural networks.

Abstract

In this work, we introduce a novel probabilistic representation of deep learning, which provides an explicit explanation for the Deep Neural Networks (DNNs) in three aspects: (i) neurons define the energy of a Gibbs distribution; (ii) the hidden layers of DNNs formulate Gibbs distributions; and (iii) the whole architecture of DNNs can be interpreted as a Bayesian neural network. Based on the proposed probabilistic representation, we investigate two fundamental properties of deep learning: hierarchy and generalization. First, we explicitly formulate the hierarchy property from the Bayesian perspective, namely that some hidden layers formulate a prior distribution and the remaining layers formulate a likelihood distribution. Second, we demonstrate that DNNs have an explicit regularization by learning a prior distribution and the learning algorithm is one reason for decreasing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis