Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities
Rahul Parhi, Michael Unser

TL;DR
This paper explores the function-space optimality of shallow neural networks with multivariate nonlinearities, introducing a new Banach space framework that characterizes optimal architectures with skip connections and links to kernel methods.
Contribution
It constructs a novel Banach space framework using the $k$-plane transform and sparsity norms, providing a representer theorem for neural architectures with multivariate nonlinearities.
Findings
Characterizes neural architectures via a new Banach space framework.
Shows compatibility with classical nonlinearities like ReLU and radial basis functions.
Connects the function spaces to reproducing kernel Banach spaces and variation spaces.
Abstract
We investigate the function-space optimality (specifically, the Banach-space optimality) of a large class of shallow neural architectures with multivariate nonlinearities/activation functions. To that end, we construct a new family of Banach spaces defined via a regularization operator, the -plane transform, and a sparsity-promoting norm. We prove a representer theorem that states that the solution sets to learning problems posed over these Banach spaces are completely characterized by neural architectures with multivariate nonlinearities. These optimal architectures have skip connections and are tightly connected to orthogonal weight normalization and multi-index models, both of which have received recent interest in the neural network community. Our framework is compatible with a number of classical nonlinearities including the rectified linear unit (ReLU) activation function, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Machine Fault Diagnosis Techniques
MethodsWeight Normalization
