NEAR: A Training-Free Pre-Estimator of Machine Learning Model   Performance

Raphael T. Husistein; Markus Reiher; and Marco Eckhoff

arXiv:2408.08776·cs.LG·February 20, 2025

NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance

Raphael T. Husistein, Markus Reiher, and Marco Eckhoff

PDF

Open Access 1 Repo 1 Video

TL;DR

NEAR is a zero-cost, training-free proxy that estimates neural network performance based on activation rank, enabling efficient architecture selection and hyperparameter tuning without training models.

Contribution

This work introduces NEAR, a novel zero-cost proxy based on activation rank, for predicting neural network performance and optimizing architecture and hyperparameters without training.

Findings

01

Strong correlation between NEAR score and model accuracy.

02

Effective in estimating optimal layer sizes in MLPs.

03

Useful for hyperparameter selection like activation functions and initialization.

Abstract

Artificial neural networks have been shown to be state-of-the-art machine learning models in a wide variety of applications, including natural language processing and image recognition. However, building a performant neural network is a laborious task and requires substantial computing power. Neural Architecture Search (NAS) addresses this issue by an automatic selection of the optimal network from a set of potential candidates. While many NAS methods still require training of (some) neural networks, zero-cost proxies promise to identify the optimal network without training. In this work, we propose the zero-cost proxy \textit{Network Expressivity by Activation Rank} (NEAR). It is based on the effective rank of the pre- and post-activation matrix, i.e., the values of a neural network layer before and after applying its activation function. We demonstrate the cutting-edge correlation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

reihergroup/near
pytorchOfficial

Videos

NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance· slideslive

Taxonomy

TopicsNeural Networks and Applications · Stock Market Forecasting Methods

MethodsSparse Evolutionary Training