A Complete Characterisation of ReLU-Invariant Distributions
Jan Macdonald, Stephan W\"aldchen

TL;DR
This paper fully characterizes the families of probability distributions invariant under ReLU neural network layers, revealing that practical invariant families are impossible without restrictive conditions.
Contribution
It proves that non-trivial ReLU-invariant distribution families cannot exist unless they are trivial, finite, or non-Lipschitz, and constructs examples for each case.
Findings
No non-trivial invariant families exist under practical conditions.
Invariant families require restrictive assumptions such as finite support or non-Lipschitz parametrization.
Constructed explicit examples for each restrictive case.
Abstract
We give a complete characterisation of families of probability distributions that are invariant under the action of ReLU neural network layers. The need for such families arises during the training of Bayesian networks or the analysis of trained neural networks, e.g., in the context of uncertainty quantification (UQ) or explainable artificial intelligence (XAI). We prove that no invariant parametrised family of distributions can exist unless at least one of the following three restrictions holds: First, the network layers have a width of one, which is unreasonable for practical neural networks. Second, the probability measures in the family have finite support, which basically amounts to sampling distributions. Third, the parametrisation of the family is not locally Lipschitz continuous, which excludes all computationally feasible families. Finally, we show that these restrictions are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Adversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference
