A General-Purpose Transferable Predictor for Neural Architecture Search

Fred X. Han; Keith G. Mills; Fabian Chudak; Parsa Riahi; Mohammad; Salameh; Jialin Zhang; Wei Lu; Shangling Jui; Di Niu

arXiv:2302.10835·cs.LG·April 19, 2023

A General-Purpose Transferable Predictor for Neural Architecture Search

Fred X. Han, Keith G. Mills, Fabian Chudak, Parsa Riahi, Mohammad, Salameh, Jialin Zhang, Wei Lu, Shangling Jui, Di Niu

PDF

Open Access

TL;DR

This paper introduces a universal neural performance predictor for NAS that leverages graph representations and contrastive learning to transfer across search spaces, outperforming existing methods and enabling effective architecture search.

Contribution

We propose a general-purpose NAS predictor using computation graph representations and contrastive learning, enabling transferability across diverse search spaces.

Findings

01

Achieves high SRCC on NAS-Bench-101, 201, and 301.

02

Outperforms several zero-cost proxies in predicting performance.

03

Successfully finds high-accuracy architectures, including MobileNetV3 on ImageNet.

Abstract

Understanding and modelling the performance of neural architectures is key to Neural Architecture Search (NAS). Performance predictors have seen widespread use in low-cost NAS and achieve high ranking correlations between predicted and ground truth performance in several NAS benchmarks. However, existing predictors are often designed based on network encodings specific to a predefined search space and are therefore not generalizable to other search spaces or new architecture families. In this paper, we propose a general-purpose neural predictor for NAS that can transfer across search spaces, by representing any given candidate Convolutional Neural Network (CNN) with a Computation Graph (CG) that consists of primitive operators. We further combine our CG network representation with Contrastive Learning (CL) and propose a graph representation learning procedure that leverages the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning

MethodsDepthwise Convolution · Sigmoid Activation · Average Pooling · Pointwise Convolution · ReLU6 · Depthwise Separable Convolution · Batch Normalization · Dense Connections · Squeeze-and-Excitation Block · 1x1 Convolution