Why are deep nets reversible: A simple theory, with implications for   training

Sanjeev Arora; Yingyu Liang; Tengyu Ma

arXiv:1511.05653·cs.LG·November 23, 2015·41 cites

Why are deep nets reversible: A simple theory, with implications for training

Sanjeev Arora, Yingyu Liang, Tengyu Ma

PDF

Open Access

TL;DR

This paper introduces a simple, theoretically justified generative model for ReLU deep nets, demonstrating that the reverse of the feedforward transformation can recover hidden layers and improve training with synthetic data.

Contribution

It provides a simple, provably correct generative model for deep nets based on weight transposition, supported by experiments on real networks like AlexNet.

Findings

01

The reverse of the feedforward transformation can recover hidden layers.

02

The generative model improves training by augmenting data with synthetic samples.

03

The theory holds under the assumption that weights behave like random variables.

Abstract

Generative models for deep learning are promising both to improve understanding of the model, and yield training methods requiring fewer labeled samples. Recent works use generative model approaches to produce the deep net's input given the value of a hidden layer several levels above. However, there is no accompanying "proof of correctness" for the generative model, showing that the feedforward deep net is the correct inference method for recovering the hidden layer given the input. Furthermore, these models are complicated. The current paper takes a more theoretical tack. It presents a very simple generative model for RELU deep nets, with the following characteristics: (i) The generative model is just the reverse of the feedforward net: if the forward transformation at a layer is $A$ then the reverse transformation is $A^{T}$ . (This can be seen as an explanation of the old weight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Neural Networks and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia?