Identifiability of Deep Polynomial Neural Networks
Konstantin Usevich, Ricardo Borsoi, Clara D\'erand, Marianne Clausel

TL;DR
This paper analyzes the conditions under which deep Polynomial Neural Networks are identifiable, linking algebraic properties with network architecture and activation degrees to enhance interpretability.
Contribution
It provides a comprehensive, constructive analysis of deep PNN identifiability, including new bounds and resolution of an open conjecture.
Findings
Identifiability depends on activation degrees and layer widths.
Non-increasing layer widths lead to generic identifiability.
Encoder-decoder networks are identifiable under certain width constraints.
Abstract
Polynomial Neural Networks (PNNs) possess a rich algebraic and geometric structure. However, their identifiability -- a key property for ensuring interpretability -- remains poorly understood. In this work, we present a comprehensive analysis of the identifiability of deep PNNs, including architectures with and without bias terms. Our results reveal an intricate interplay between activation degrees and layer widths in achieving identifiability. As special cases, we show that architectures with non-increasing layer widths are generically identifiable under mild conditions, while encoder-decoder networks are identifiable when the decoder widths do not grow too rapidly compared to the activation degrees. Our proofs are constructive and center on a connection between deep PNNs and low-rank tensor decompositions, and Kruskal-type uniqueness theorems. We also settle an open conjecture on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)
