Stably unactivated neurons in ReLU neural networks

Natalie Brownlowe; Christopher R. Cornwell; Ethan Montes; Gabriel; Quijano; Grace Stulman; Na Zhang

arXiv:2412.06829·cs.LG·December 18, 2024

Stably unactivated neurons in ReLU neural networks

Natalie Brownlowe, Christopher R. Cornwell, Ethan Montes, Gabriel, Quijano, Grace Stulman, Na Zhang

PDF

Open Access

TL;DR

This paper analyzes the probability of neurons remaining stably unactivated in ReLU neural networks, providing exact formulas for certain layer sizes and proposing a conjecture supported by computational evidence.

Contribution

It derives exact probabilities for stably unactivated neurons in the second hidden layer under specific conditions and introduces a conjecture for more complex cases.

Findings

01

Exact probability formulas for specific layer sizes.

02

A conjecture for cases with more neurons than input dimension.

03

Computational evidence supporting the conjecture.

Abstract

The choice of architecture of a neural network influences which functions will be realizable by that neural network and, as a result, studying the expressiveness of a chosen architecture has received much attention. In ReLU neural networks, the presence of stably unactivated neurons can reduce the network's expressiveness. In this work, we investigate the probability of a neuron in the second hidden layer of such neural networks being stably unactivated when the weights and biases are initialized from symmetric probability distributions. For networks with input dimension $n_{0}$ , we prove that if the first hidden layer has $n_{0} + 1$ neurons then this probability is exactly $\frac{2 ^{n_{0}} + 1}{4 ^{n_{0} + 1}}$ , and if the first hidden layer has $n_{1}$ neurons, $n_{1} \leq n_{0}$ , then the probability is $\frac{1}{2 ^{n_{1} + 1}}$ . Finally, for the case when the first hidden layer has more neurons than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia?