Avoiding pathologies in very deep networks

David Duvenaud; Oren Rippel; Ryan P. Adams; Zoubin Ghahramani

arXiv:1402.5836·stat.ML·July 12, 2016·86 cites

Avoiding pathologies in very deep networks

David Duvenaud, Oren Rippel, Ryan P. Adams, Zoubin Ghahramani

PDF

Open Access 2 Repos

TL;DR

This paper investigates the limitations of deep Gaussian processes and proposes alternative architectures to prevent capacity collapse, enhancing the understanding of deep network regularization and design.

Contribution

It identifies a pathology in standard deep Gaussian processes and introduces an architecture that maintains representational capacity across many layers.

Findings

01

Standard architectures tend to lose degrees of freedom with depth.

02

Proposed architecture avoids capacity collapse in deep networks.

03

Analyzed deep covariance functions and dropout effects on Gaussian processes.

Abstract

Choosing appropriate architectures and regularization strategies for deep networks is crucial to good predictive performance. To shed light on this problem, we analyze the analogous problem of constructing useful priors on compositions of functions. Specifically, we study the deep Gaussian process, a type of infinitely-wide, deep neural network. We show that in standard architectures, the representational capacity of the network tends to capture fewer degrees of freedom as the number of layers increases, retaining only a single degree of freedom in the limit. We propose an alternate network architecture which does not suffer from this pathology. We also examine deep covariance functions, obtained by composing infinitely many feature transforms. Lastly, we characterize the class of models obtained by performing dropout on Gaussian processes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Control Systems and Identification · Neural Networks and Applications