A theory of data variability in Neural Network Bayesian inference
Javed Lindner, David Dahmen, Michael Kr\"amer, Moritz Helias

TL;DR
This paper develops a field-theoretic framework to analyze the generalization properties of infinitely wide neural networks, accounting for data variability and providing exact bounds on learning curves.
Contribution
It introduces a novel field-theoretic approach that extends kernel methods to include data variability effects in neural network generalization analysis.
Findings
Data variability induces non-Gaussian effects in the model.
The formalism accurately predicts learning curves on synthetic and MNIST data.
Provides bounds for generalization performance in the infinite data limit.
Abstract
Bayesian inference and kernel methods are well established in machine learning. The neural network Gaussian process in particular provides a concept to investigate neural networks in the limit of infinitely wide hidden layers by using kernel and inference methods. Here we build upon this limit and provide a field-theoretic formalism which covers the generalization properties of infinitely wide networks. We systematically compute generalization properties of linear, non-linear, and deep non-linear networks for kernel matrices with heterogeneous entries. In contrast to currently employed spectral methods we derive the generalization properties from the statistical properties of the input, elucidating the interplay of input dimensionality, size of the training data set, and variability of the data. We show that data variability leads to a non-Gaussian action reminiscent of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Face and Expression Recognition · Neural Networks and Applications
MethodsGaussian Process
