Stably unactivated neurons in ReLU neural networks
Natalie Brownlowe, Christopher R. Cornwell, Ethan Montes, Gabriel, Quijano, Grace Stulman, Na Zhang

TL;DR
This paper analyzes the probability of neurons remaining stably unactivated in ReLU neural networks, providing exact formulas for certain layer sizes and proposing a conjecture supported by computational evidence.
Contribution
It derives exact probabilities for stably unactivated neurons in the second hidden layer under specific conditions and introduces a conjecture for more complex cases.
Findings
Exact probability formulas for specific layer sizes.
A conjecture for cases with more neurons than input dimension.
Computational evidence supporting the conjecture.
Abstract
The choice of architecture of a neural network influences which functions will be realizable by that neural network and, as a result, studying the expressiveness of a chosen architecture has received much attention. In ReLU neural networks, the presence of stably unactivated neurons can reduce the network's expressiveness. In this work, we investigate the probability of a neuron in the second hidden layer of such neural networks being stably unactivated when the weights and biases are initialized from symmetric probability distributions. For networks with input dimension , we prove that if the first hidden layer has neurons then this probability is exactly , and if the first hidden layer has neurons, , then the probability is . Finally, for the case when the first hidden layer has more neurons than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia?
