Neural Architecture Search without Training

Joseph Mellor; Jack Turner; Amos Storkey; Elliot J. Crowley

arXiv:2006.04647·cs.LG·June 14, 2021·44 cites

Neural Architecture Search without Training

Joseph Mellor, Jack Turner, Amos Storkey, Elliot J. Crowley

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces a novel NAS method that predicts network performance from untrained activation overlaps, enabling rapid architecture search without training, significantly reducing computational costs.

Contribution

It proposes a new measure based on activation overlap in untrained networks to predict trained accuracy, facilitating fast NAS without training.

Findings

01

Search can be performed in seconds on a single GPU.

02

The method is effective across multiple NAS benchmarks.

03

It can be combined with existing search algorithms.

Abstract

The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network's trained accuracy from its initial state. In this work, we examine the overlap of activations between datapoints in untrained networks and motivate how this can give a measure which is usefully indicative of a network's trained performance. We incorporate this measure into a simple algorithm that allows us to search for powerful networks without any training in a matter of seconds on a single GPU, and verify its effectiveness on NAS-Bench-101, NAS-Bench-201, NATS-Bench, and Network Design Spaces. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Neural Architecture Search without Training (Paper Explained)· youtube

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning

MethodsSigmoid Activation · Softmax · Tanh Activation · Long Short-Term Memory