On the validity of kernel approximations for orthogonally-initialized   neural networks

James Martens

arXiv:2104.05878·cs.LG·April 14, 2021·1 cites

On the validity of kernel approximations for orthogonally-initialized neural networks

James Martens

PDF

Open Access

TL;DR

This paper extends kernel approximation results from Gaussian to orthogonally-initialized neural networks using Haar-distributed matrices, leveraging recent random matrix theory insights.

Contribution

It introduces a novel extension of kernel approximation analysis to orthogonal initializations, broadening understanding of neural network behavior.

Findings

01

Kernel approximation results hold for orthogonally-initialized networks.

02

Uses random matrix theory to establish theoretical guarantees.

03

Extends prior Gaussian-based analyses to Haar-distributed orthogonal matrices.

Abstract

In this note we extend kernel function approximation results for neural networks with Gaussian-distributed weights to single-layer networks initialized using Haar-distributed random orthogonal matrices (with possible rescaling). This is accomplished using recent results from random matrix theory.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Matrix Theory and Algorithms · Neural Networks and Applications