Volume-preserving Neural Networks

Gordon MacDonald; Andrew Godbout; Bryn Gillcash; Stephanie; Cairns

arXiv:1911.09576·cs.LG·April 27, 2021

Volume-preserving Neural Networks

Gordon MacDonald, Andrew Godbout, Bryn Gillcash, Stephanie, Cairns

PDF

1 Repo

TL;DR

This paper introduces a volume-preserving neural network architecture that addresses the vanishing and exploding gradient problems by ensuring all layers maintain volume, leading to more stable training.

Contribution

The authors develop a novel neural network architecture with volume-preserving layers, combining rotation, permutation, diagonal, and a new coupled activation function.

Findings

01

Successfully applied to two standard datasets

02

Maintains stable gradients during training

03

Reduces vanishing and exploding gradient issues

Abstract

We propose a novel approach to addressing the vanishing (or exploding) gradient problem in deep neural networks. We construct a new architecture for deep neural networks where all layers (except the output layer) of the network are a combination of rotation, permutation, diagonal, and activation sublayers which are all volume preserving. Our approach replaces the standard weight matrix of a neural network with a combination of diagonal, rotational and permutation matrices, all of which are volume-preserving. We introduce a coupled activation function allowing us to preserve volume even in the activation function portion of a neural network layer. This control on the volume forces the gradient (on average) to maintain equilibrium and not explode or vanish. To demonstrate our architecture we apply our volume-preserving neural network model to two standard datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andrewgodbout/VPNN_pytorch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.