The Separation Capacity of Random Neural Networks
Sjoerd Dirksen, Martin Genzel, Laurent Jacques, Alexander Stollenwerk

TL;DR
This paper provides a theoretical analysis of the separation capacity of random neural networks, showing that with enough neurons, they can linearly separate classes based on geometric data properties, overcoming high-dimensional challenges.
Contribution
It introduces an explicit link between the number of neurons needed and the geometric complexity of data, advancing understanding of random neural networks' separation abilities.
Findings
Random two-layer ReLU networks can linearly separate classes with high probability.
The required number of neurons depends on data geometry and mutual complexity.
The approach overcomes the curse of dimensionality in low-complexity data scenarios.
Abstract
Neural networks with random weights appear in a variety of machine learning applications, most prominently as the initialization of many deep learning algorithms and as a computationally cheap alternative to fully learned neural networks. In the present article, we enhance the theoretical understanding of random neural networks by addressing the following data separation problem: under what conditions can a random neural network make two classes (with positive distance) linearly separable? We show that a sufficiently large two-layer ReLU-network with standard Gaussian weights and uniformly distributed biases can solve this problem with high probability. Crucially, the number of required neurons is explicitly linked to geometric properties of the underlying sets and their mutual arrangement. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Complexity and Algorithms in Graphs
