Random deep neural networks are biased towards simple functions

Giacomo De Palma; Bobak Toussi Kiani; Seth Lloyd

arXiv:1812.10156·stat.ML·December 4, 2019·31 cites

Random deep neural networks are biased towards simple functions

Giacomo De Palma, Bobak Toussi Kiani, Seth Lloyd

PDF

Open Access 1 Repo

TL;DR

This paper proves that random wide deep neural networks with ReLU activation tend to produce simple functions, which helps explain their strong generalization properties and provides insights into their prior distribution in PAC-Bayesian bounds.

Contribution

It establishes that random deep neural networks are biased towards simple functions, confirming a conjecture and advancing understanding of their generalization behavior.

Findings

01

Average Hamming distance to different classification is at least sqrt(n / (2π log n))

02

Number of flips to change classification grows linearly with n

03

Results confirmed by numerical experiments on networks with two hidden layers

Abstract

We prove that the binary classifiers of bit strings generated by random wide deep neural networks with ReLU activation function are biased towards simple functions. The simplicity is captured by the following two properties. For any given input bit string, the average Hamming distance of the closest input bit string with a different classification is at least sqrt(n / (2{\pi} log n)), where n is the length of the string. Moreover, if the bits of the initial string are flipped randomly, the average number of flips required to change the classification grows linearly with n. These results are confirmed by numerical experiments on deep neural networks with two hidden layers, and settle the conjecture stating that random deep neural networks are biased towards simple functions. This conjecture was proposed and numerically explored in [Valle P\'erez et al., ICLR 2019] to explain the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bkiani/random_NN_simplicity
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Machine Learning and Data Classification

Methods*Communicated@Fast*How Do I Communicate to Expedia?