GENNAPE: Towards Generalized Neural Architecture Performance Estimators

Keith G. Mills; Fred X. Han; Jialin Zhang; Fabian Chudak; Ali Safari; Mamaghani; Mohammad Salameh; Wei Lu; Shangling Jui; Di Niu

arXiv:2211.17226·cs.LG·July 6, 2023

GENNAPE: Towards Generalized Neural Architecture Performance Estimators

Keith G. Mills, Fred X. Han, Jialin Zhang, Fabian Chudak, Ali Safari, Mamaghani, Mohammad Salameh, Wei Lu, Shangling Jui, Di Niu

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

GENNAPE is a novel neural architecture performance estimator that generalizes across unseen architectures using contrastive learning and fuzzy clustering, outperforming existing predictors in transferability and aiding efficient neural architecture search.

Contribution

It introduces a generalized, pretrained performance estimator for neural networks that effectively models unseen architectures through graph encoding, contrastive pretraining, and ensemble predictors.

Findings

01

GENNAPE pretrained on NAS-Bench-101 transfers well to other benchmarks.

02

It accurately identifies high-performance architectures in new benchmarks.

03

It improves architecture search results by balancing accuracy and FLOPs.

Abstract

Predicting neural architecture performance is a challenging task and is crucial to neural architecture design and search. Existing approaches either rely on neural performance predictors which are limited to modeling architectures in a predefined design space involving specific sets of operators and connection rules, and cannot generalize to unseen architectures, or resort to zero-cost proxies which are not always accurate. In this paper, we propose GENNAPE, a Generalized Neural Architecture Performance Estimator, which is pretrained on open neural architecture benchmarks, and aims to generalize to completely unseen architectures through combined innovations in network representation, contrastive pretraining, and fuzzy clustering-based predictor ensemble. Specifically, GENNAPE represents a given neural network as a Computation Graph (CG) of atomic operations which can model an arbitrary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Ascend-Research/GENNAPE
pytorchOfficial

Datasets

kgmills/GENNAPE
dataset· 24 dl
24 dl

Videos

GENNAPE: Towards Generalized Neural Architecture Performance Estimators· underline

Taxonomy

TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Ferroelectric and Negative Capacitance Devices

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · 1x1 Convolution · Global Average Pooling · Residual Connection · Bottleneck Residual Block · Batch Normalization · Kaiming Initialization · Max Pooling · Residual Block