Large deviation principles for convolutional Bayesian neural networks
Federico Bassetti, Vassili De Palma, Lucia Ladelli

TL;DR
This paper establishes the first large deviation principle for convolutional neural networks, analyzing their behavior beyond the Gaussian limit and providing insights into their probabilistic properties in the infinite-channel regime.
Contribution
It introduces a large deviation principle for CNNs with Gaussian priors, extending understanding beyond the Gaussian process limit and including posterior distributions conditioned on data.
Findings
LDP for the sequence of covariance matrices in CNNs
LDP for the posterior distribution given observations
Streamlined proof of covariance concentration and Gaussian equivalence
Abstract
While suitably scaled CNNs with Gaussian initialization are known to converge to Gaussian processes as the number of channels diverges, little is known beyond this Gaussian limit. We establish a large deviation principle (LDP) for convolutional neural networks in the infinite-channel regime. We consider a broad class of multidimensional CNN architectures characterized by general receptive fields encoded through a patch-extractor function satisfying mild structural assumptions. Our main result establishes a large deviation principle (LDP) for the sequence of conditional covariance matrices under Gaussian prior distribution on the weights. We further derive an LDP for the posterior distribution obtained by conditioning on a finite number of observations. In addition, we provide a streamlined proof of the concentration of the conditional covariances and of the Gaussian equivalence of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference
