Hessian Eigenvectors and Principal Component Analysis of Neural Network Weight Matrices
David Haink

TL;DR
This paper investigates the relationship between Hessian eigenvectors and neural network weights, revealing how principal component analysis and singular value decomposition can identify critical parameter directions to improve understanding and mitigate catastrophic forgetting.
Contribution
It introduces a novel analysis linking Hessian eigenvectors with network weights and demonstrates how PCA and SVD can identify important directions to address catastrophic forgetting.
Findings
Hessian eigenvectors correlate with network weights based on eigenvalue magnitude.
Principal component analysis effectively approximates the Hessian, especially using update parameters.
Deeper layers exhibit larger Hessian eigenvalues, indicating higher curvature.
Abstract
This study delves into the intricate dynamics of trained deep neural networks and their relationships with network parameters. Trained networks predominantly continue training in a single direction, known as the drift mode. This drift mode can be explained by the quadratic potential model of the loss function, suggesting a slow exponential decay towards the potential minima. We unveil a correlation between Hessian eigenvectors and network weights. This relationship, hinging on the magnitude of eigenvalues, allows us to discern parameter directions within the network. Notably, the significance of these directions relies on two defining attributes: the curvature of their potential wells (indicated by the magnitude of Hessian eigenvalues) and their alignment with the weight vectors. Our exploration extends to the decomposition of weight matrices through singular value decomposition. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · COVID-19 diagnosis using AI · Brain Tumor Detection and Classification
MethodsExponential Decay
