Approximate Spectral Decomposition of Fisher Information Matrix for Simple ReLU Networks
Yoshinari Takeishi, Masazumi Iida, Jun'ichi Takeuchi

TL;DR
This paper provides an approximate spectral decomposition of the Fisher Information Matrix for simple ReLU networks, revealing the structure of its eigenvalues and eigenvectors under certain conditions.
Contribution
It offers a novel theoretical characterization of the FIM's eigenstructure for one hidden layer ReLU networks, which was previously not well-understood.
Findings
Eigenvalues cluster into three groups with specific eigenvectors.
The largest eigenvalue corresponds to the Perron-Frobenius eigenvalue.
Eigenvectors of the second cluster span the row space of W.
Abstract
We argue the Fisher information matrix (FIM) of one hidden layer networks with the ReLU activation function. For a network, let denote the weight matrix from the -dimensional input to the hidden layer consisting of neurons, and the -dimensional weight vector from the hidden layer to the scalar output. We focus on the FIM of , which we denote as . Under certain conditions, we characterize the first three clusters of eigenvalues and eigenvectors of the FIM. Specifically, we show that 1) Since is non-negative owing to the ReLU, the first eigenvalue is the Perron-Frobenius eigenvalue. 2) For the cluster of the next maximum values, the eigenspace is spanned by the row vectors of . 3) The direct sum of the eigenspace of the first eigenvalue and that of the third cluster is spanned by the set of all the vectors obtained as the Hadamard product of any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Stochastic Gradient Optimization Techniques
