Neural Architecture Search: Two Constant Shared Weights Initialisations

Ekaterina Gracheva

arXiv:2302.04406·cs.LG·July 8, 2025

Neural Architecture Search: Two Constant Shared Weights Initialisations

Ekaterina Gracheva

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces epsinas, a fast, zero-cost neural architecture evaluation metric based on two constant shared weight initialisations, which correlates strongly with trained accuracy across various tasks and datasets.

Contribution

The paper proposes epsinas, a novel zero-cost NAS metric using constant shared weight initialisations and output statistics, requiring no training or labels, and enabling rapid architecture evaluation.

Findings

01

Strong correlation with trained accuracy across tasks

02

Operates in a fraction of a GPU second

03

No need for data labels or gradient computation

Abstract

In the last decade, zero-cost metrics have gained prominence in neural architecture search (NAS) due to their ability to evaluate architectures without training. These metrics are significantly faster and less computationally expensive than traditional NAS methods and provide insights into neural architectures' internal workings. This paper introduces epsinas, a novel zero-cost NAS metric that assesses architecture potential using two constant shared weight initialisations and the statistics of their outputs. We show that the dispersion of raw outputs, normalised by their average magnitude, strongly correlates with trained accuracy. This effect holds across image classification and language tasks on NAS-Bench-101, NAS-Bench-201, and NAS-Bench-NLP. Our method requires no data labels, operates on a single minibatch, and eliminates the need for gradient computation, making it independent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

egracheva/epsinas
pytorchOfficial

Datasets

egracheva/epsinas-release-data
dataset· 159 dl
159 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Neural Networks and Applications