Random deep neural networks are biased towards simple functions
Giacomo De Palma, Bobak Toussi Kiani, Seth Lloyd

TL;DR
This paper proves that random wide deep neural networks with ReLU activation tend to produce simple functions, which helps explain their strong generalization properties and provides insights into their prior distribution in PAC-Bayesian bounds.
Contribution
It establishes that random deep neural networks are biased towards simple functions, confirming a conjecture and advancing understanding of their generalization behavior.
Findings
Average Hamming distance to different classification is at least sqrt(n / (2π log n))
Number of flips to change classification grows linearly with n
Results confirmed by numerical experiments on networks with two hidden layers
Abstract
We prove that the binary classifiers of bit strings generated by random wide deep neural networks with ReLU activation function are biased towards simple functions. The simplicity is captured by the following two properties. For any given input bit string, the average Hamming distance of the closest input bit string with a different classification is at least sqrt(n / (2{\pi} log n)), where n is the length of the string. Moreover, if the bits of the initial string are flipped randomly, the average number of flips required to change the classification grows linearly with n. These results are confirmed by numerical experiments on deep neural networks with two hidden layers, and settle the conjecture stating that random deep neural networks are biased towards simple functions. This conjecture was proposed and numerically explored in [Valle P\'erez et al., ICLR 2019] to explain the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Machine Learning and Data Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia?
