Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets
Hayeon Lee, Eunyoung Hyung, Sung Ju Hwang

TL;DR
This paper introduces MetaD2A, a meta-learning framework that rapidly generates neural network architectures for new datasets by learning from a database of datasets and pretrained models, significantly reducing search time.
Contribution
The paper presents MetaD2A, a novel meta-learning approach that generalizes neural architecture search across multiple datasets with minimal computational cost.
Findings
MetaD2A generalizes well to unseen datasets like CIFAR-10 and CIFAR-100.
MetaD2A achieves a search time of approximately 33 GPU seconds.
MetaD2A outperforms existing methods like NSGANetV2 in speed with comparable accuracy.
Abstract
Despite the success of recent Neural Architecture Search (NAS) methods on various tasks which have shown to output networks that largely outperform human-designed networks, conventional NAS methods have mostly tackled the optimization of searching for the network architecture for a single task (dataset), which does not generalize well across multiple tasks (datasets). Moreover, since such task-specific methods search for a neural architecture from scratch for every given task, they incur a large computational cost, which is problematic when the time and monetary budget are limited. In this paper, we propose an efficient NAS framework that is trained once on a database consisting of datasets and pretrained networks and can rapidly search for a neural architecture for a novel dataset. The proposed MetaD2A (Meta Dataset-to-Architecture) model can stochastically generate graphs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsDepthwise Convolution · Pointwise Convolution · ReLU6 · Depthwise Separable Convolution · Batch Normalization · Dense Connections · Sigmoid Activation · Hard Swish · 1x1 Convolution · Dropout
