Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias
Yuanzhe Hu, Kinshuk Goel, Vlad Killiakov, Yaoqing Yang

TL;DR
This paper introduces FARMS, a normalization technique that reduces aspect ratio bias in eigenspectrum analysis of neural network weights, improving diagnosis and hyperparameter tuning across various models.
Contribution
We propose FARMS, a subsampling method that normalizes weight matrices to mitigate aspect ratio bias in eigenspectrum analysis of neural networks.
Findings
FARMS effectively reduces aspect ratio bias in eigenspectrum analysis.
Applying FARMS improves layer-wise hyperparameter assignment accuracy.
FARMS reduces LLaMA-7B model perplexity by 17.3% in pruning experiments.
Abstract
Diagnosing deep neural networks (DNNs) by analyzing the eigenspectrum of their weights has been an active area of research in recent years. One of the main approaches involves measuring the heavytailness of the empirical spectral densities (ESDs) of weight matrices. This analysis has been shown to provide insights to help diagnose whether a model is well-trained or undertrained, and has been used to guide training methods involving layer-wise hyperparameter assignment. In this paper, we address an often-overlooked challenge in estimating the heavytailness of these ESDs: the impact of the aspect ratio of weight matrices. We demonstrate that matrices of varying sizes (and aspect ratios) introduce a non-negligible bias in estimating the heavytailness of ESDs, leading to inaccurate model diagnosis and layer-wise hyperparameter assignment. To overcome this challenge, we propose FARMS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
MethodsPruning
