A Complete Characterisation of ReLU-Invariant Distributions

Jan Macdonald; Stephan W\"aldchen

arXiv:2112.06532·cs.LG·December 14, 2021

A Complete Characterisation of ReLU-Invariant Distributions

Jan Macdonald, Stephan W\"aldchen

PDF

Open Access

TL;DR

This paper fully characterizes the families of probability distributions invariant under ReLU neural network layers, revealing that practical invariant families are impossible without restrictive conditions.

Contribution

It proves that non-trivial ReLU-invariant distribution families cannot exist unless they are trivial, finite, or non-Lipschitz, and constructs examples for each case.

Findings

01

No non-trivial invariant families exist under practical conditions.

02

Invariant families require restrictive assumptions such as finite support or non-Lipschitz parametrization.

03

Constructed explicit examples for each restrictive case.

Abstract

We give a complete characterisation of families of probability distributions that are invariant under the action of ReLU neural network layers. The need for such families arises during the training of Bayesian networks or the analysis of trained neural networks, e.g., in the context of uncertainty quantification (UQ) or explainable artificial intelligence (XAI). We prove that no invariant parametrised family of distributions can exist unless at least one of the following three restrictions holds: First, the network layers have a width of one, which is unreasonable for practical neural networks. Second, the probability measures in the family have finite support, which basically amounts to sampling distributions. Third, the parametrisation of the family is not locally Lipschitz continuous, which excludes all computationally feasible families. Finally, we show that these restrictions are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Adversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference