A General-Purpose Transferable Predictor for Neural Architecture Search
Fred X. Han, Keith G. Mills, Fabian Chudak, Parsa Riahi, Mohammad, Salameh, Jialin Zhang, Wei Lu, Shangling Jui, Di Niu

TL;DR
This paper introduces a universal neural performance predictor for NAS that leverages graph representations and contrastive learning to transfer across search spaces, outperforming existing methods and enabling effective architecture search.
Contribution
We propose a general-purpose NAS predictor using computation graph representations and contrastive learning, enabling transferability across diverse search spaces.
Findings
Achieves high SRCC on NAS-Bench-101, 201, and 301.
Outperforms several zero-cost proxies in predicting performance.
Successfully finds high-accuracy architectures, including MobileNetV3 on ImageNet.
Abstract
Understanding and modelling the performance of neural architectures is key to Neural Architecture Search (NAS). Performance predictors have seen widespread use in low-cost NAS and achieve high ranking correlations between predicted and ground truth performance in several NAS benchmarks. However, existing predictors are often designed based on network encodings specific to a predefined search space and are therefore not generalizable to other search spaces or new architecture families. In this paper, we propose a general-purpose neural predictor for NAS that can transfer across search spaces, by representing any given candidate Convolutional Neural Network (CNN) with a Computation Graph (CG) that consists of primitive operators. We further combine our CG network representation with Contrastive Learning (CL) and propose a graph representation learning procedure that leverages the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
MethodsDepthwise Convolution · Sigmoid Activation · Average Pooling · Pointwise Convolution · ReLU6 · Depthwise Separable Convolution · Batch Normalization · Dense Connections · Squeeze-and-Excitation Block · 1x1 Convolution
