An Embedding of ReLU Networks and an Analysis of their Identifiability

Pierre Stock; R\'emi Gribonval

arXiv:2107.09370·cs.LG·June 8, 2022

An Embedding of ReLU Networks and an Analysis of their Identifiability

Pierre Stock, R\'emi Gribonval

PDF

Open Access

TL;DR

This paper introduces an embedding for ReLU neural networks that is invariant to scalings and permutations, enabling analysis of their local identifiability from finite sample realizations.

Contribution

The paper proposes a novel invariant embedding for ReLU networks and derives conditions for their local identifiability from finite data samples.

Findings

01

Embedding is invariant to scalings and permutations.

02

Conditions for local identifiability are established.

03

Identifiability criteria are characterized for shallow networks.

Abstract

Neural networks with the Rectified Linear Unit (ReLU) nonlinearity are described by a vector of parameters $θ$ , and realized as a piecewise linear continuous function $R_{θ} : x \in R^{d} \mapsto R_{θ} (x) \in R^{k}$ . Natural scalings and permutations operations on the parameters $θ$ leave the realization unchanged, leading to equivalence classes of parameters that yield the same realization. These considerations in turn lead to the notion of identifiability -- the ability to recover (the equivalence class of) $θ$ from the sole knowledge of its realization $R_{θ}$ . The overall objective of this paper is to introduce an embedding for ReLU neural networks of any depth, $Φ (θ)$ , that is invariant to scalings and that provides a locally linear parameterization of the realization of the network. Leveraging these two key properties, we derive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Neural Networks and Applications