GENNAPE: Towards Generalized Neural Architecture Performance Estimators
Keith G. Mills, Fred X. Han, Jialin Zhang, Fabian Chudak, Ali Safari, Mamaghani, Mohammad Salameh, Wei Lu, Shangling Jui, Di Niu

TL;DR
GENNAPE is a novel neural architecture performance estimator that generalizes across unseen architectures using contrastive learning and fuzzy clustering, outperforming existing predictors in transferability and aiding efficient neural architecture search.
Contribution
It introduces a generalized, pretrained performance estimator for neural networks that effectively models unseen architectures through graph encoding, contrastive pretraining, and ensemble predictors.
Findings
GENNAPE pretrained on NAS-Bench-101 transfers well to other benchmarks.
It accurately identifies high-performance architectures in new benchmarks.
It improves architecture search results by balancing accuracy and FLOPs.
Abstract
Predicting neural architecture performance is a challenging task and is crucial to neural architecture design and search. Existing approaches either rely on neural performance predictors which are limited to modeling architectures in a predefined design space involving specific sets of operators and connection rules, and cannot generalize to unseen architectures, or resort to zero-cost proxies which are not always accurate. In this paper, we propose GENNAPE, a Generalized Neural Architecture Performance Estimator, which is pretrained on open neural architecture benchmarks, and aims to generalize to completely unseen architectures through combined innovations in network representation, contrastive pretraining, and fuzzy clustering-based predictor ensemble. Specifically, GENNAPE represents a given neural network as a Computation Graph (CG) of atomic operations which can model an arbitrary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Ferroelectric and Negative Capacitance Devices
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · 1x1 Convolution · Global Average Pooling · Residual Connection · Bottleneck Residual Block · Batch Normalization · Kaiming Initialization · Max Pooling · Residual Block
