Capacity of the covariance perceptron
David Dahmen, Matthieu Gilson, Moritz Helias

TL;DR
This paper introduces the covariance perceptron, a neural network model that classifies based on covariances of time series, extending classical perceptron theory and demonstrating higher capacity and information storage in high-dimensional input scenarios.
Contribution
It extends Gardner's theory to bilinear covariance mappings, deriving capacity formulas and showing the covariance perceptron outperforms the classical perceptron in certain high-dimensional tasks.
Findings
Covariance perceptron has higher pattern capacity than classical perceptron.
It stores more information due to larger input-output space.
Theoretical results are validated with numerical experiments.
Abstract
The classical perceptron is a simple neural network that performs a binary classification by a linear mapping between static inputs and outputs and application of a threshold. For small inputs, neural networks in a stationary state also perform an effectively linear input-output transformation, but of an entire time series. Choosing the temporal mean of the time series as the feature for classification, the linear transformation of the network with subsequent thresholding is equivalent to the classical perceptron. Here we show that choosing covariances of time series as the feature for classification maps the neural network to what we call a 'covariance perceptron'; a mapping between covariances that is bilinear in terms of weights. By extending Gardner's theory of connections to this bilinear problem, using a replica symmetric mean-field theory, we compute the pattern and information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
