On Connecting Deep Trigonometric Networks with Deep Gaussian Processes: Covariance, Expressivity, and Neural Tangent Kernel
Chi-Ken Lu, Patrick Shafto

TL;DR
This paper links deep Gaussian processes with deep trigonometric networks, revealing their covariance structures, expressivity, and neural tangent kernel, and provides insights into their inference and feature learning capabilities.
Contribution
It establishes a connection between DGPs and deep trig networks using Bochner's theorem, enabling exact MAP estimation and analysis of neural tangent kernels.
Findings
Deep DGPs can be represented as deep trig networks with equivalent covariance functions.
Varying priors over network parameters corresponds to different kernels.
Finite-width deep networks deviate from the limiting kernel, affecting feature learning.
Abstract
Deep Gaussian Process (DGP) as a model prior in Bayesian learning intuitively exploits the expressive power in function composition. DGPs also offer diverse modeling capabilities, but inference is challenging because marginalization in latent function space is not tractable. With Bochner's theorem, DGP with squared exponential kernel can be viewed as a deep trigonometric network consisting of the random feature layers, sine and cosine activation units, and random weight layers. In the wide limit with a bottleneck, we show that the weight space view yields the same effective covariance functions which were obtained previously in function space. Also, varying the prior distributions over network parameters is equivalent to employing different kernels. As such, DGPs can be translated into the deep bottlenecked trig networks, with which the exact maximum a posteriori estimation can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Bayesian Methods and Mixture Models
MethodsGaussian Process
