On the validity of kernel approximations for orthogonally-initialized neural networks
James Martens

TL;DR
This paper extends kernel approximation results from Gaussian to orthogonally-initialized neural networks using Haar-distributed matrices, leveraging recent random matrix theory insights.
Contribution
It introduces a novel extension of kernel approximation analysis to orthogonal initializations, broadening understanding of neural network behavior.
Findings
Kernel approximation results hold for orthogonally-initialized networks.
Uses random matrix theory to establish theoretical guarantees.
Extends prior Gaussian-based analyses to Haar-distributed orthogonal matrices.
Abstract
In this note we extend kernel function approximation results for neural networks with Gaussian-distributed weights to single-layer networks initialized using Haar-distributed random orthogonal matrices (with possible rescaling). This is accomplished using recent results from random matrix theory.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Matrix Theory and Algorithms · Neural Networks and Applications
