NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance
Raphael T. Husistein, Markus Reiher, and Marco Eckhoff

TL;DR
NEAR is a zero-cost, training-free proxy that estimates neural network performance based on activation rank, enabling efficient architecture selection and hyperparameter tuning without training models.
Contribution
This work introduces NEAR, a novel zero-cost proxy based on activation rank, for predicting neural network performance and optimizing architecture and hyperparameters without training.
Findings
Strong correlation between NEAR score and model accuracy.
Effective in estimating optimal layer sizes in MLPs.
Useful for hyperparameter selection like activation functions and initialization.
Abstract
Artificial neural networks have been shown to be state-of-the-art machine learning models in a wide variety of applications, including natural language processing and image recognition. However, building a performant neural network is a laborious task and requires substantial computing power. Neural Architecture Search (NAS) addresses this issue by an automatic selection of the optimal network from a set of potential candidates. While many NAS methods still require training of (some) neural networks, zero-cost proxies promise to identify the optimal network without training. In this work, we propose the zero-cost proxy \textit{Network Expressivity by Activation Rank} (NEAR). It is based on the effective rank of the pre- and post-activation matrix, i.e., the values of a neural network layer before and after applying its activation function. We demonstrate the cutting-edge correlation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Stock Market Forecasting Methods
MethodsSparse Evolutionary Training
