Efficient Novelty-Driven Neural Architecture Search

Miao Zhang; Huiqi Li; Shirui Pan; Taoping Liu; Steven Su

arXiv:1907.09109·cs.LG·July 23, 2019·1 cites

Efficient Novelty-Driven Neural Architecture Search

Miao Zhang, Huiqi Li, Shirui Pan, Taoping Liu, Steven Su

PDF

Open Access

TL;DR

This paper introduces a novelty search-based sampling method for neural architecture search that improves supernet training predictiveness, achieves state-of-the-art results efficiently, and reduces computational costs.

Contribution

It proposes a novel architecture sampling strategy based on novelty search, replacing complex controllers, to enhance NAS efficiency and accuracy.

Findings

01

Achieved 2.51% test error on CIFAR-10 within 7.5 hours.

02

Demonstrated effectiveness on PTB, ImageNet, and WikiText-2 datasets.

03

Outperformed existing NAS methods in search efficiency and accuracy.

Abstract

One-Shot Neural architecture search (NAS) attracts broad attention recently due to its capacity to reduce the computational hours through weight sharing. However, extensive experiments on several recent works show that there is no positive correlation between the validation accuracy with inherited weights from the supernet and the test accuracy after re-training for One-Shot NAS. Different from devising a controller to find the best performing architecture with inherited weights, this paper focuses on how to sample architectures to train the supernet to make it more predictive. A single-path supernet is adopted, where only a small part of weights are optimized in each step, to reduce the memory demand greatly. Furthermore, we abandon devising complicated reward based architecture sampling controller, and sample architectures to train supernet based on novelty search. An efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory