Efficient Novelty-Driven Neural Architecture Search
Miao Zhang, Huiqi Li, Shirui Pan, Taoping Liu, Steven Su

TL;DR
This paper introduces a novelty search-based sampling method for neural architecture search that improves supernet training predictiveness, achieves state-of-the-art results efficiently, and reduces computational costs.
Contribution
It proposes a novel architecture sampling strategy based on novelty search, replacing complex controllers, to enhance NAS efficiency and accuracy.
Findings
Achieved 2.51% test error on CIFAR-10 within 7.5 hours.
Demonstrated effectiveness on PTB, ImageNet, and WikiText-2 datasets.
Outperformed existing NAS methods in search efficiency and accuracy.
Abstract
One-Shot Neural architecture search (NAS) attracts broad attention recently due to its capacity to reduce the computational hours through weight sharing. However, extensive experiments on several recent works show that there is no positive correlation between the validation accuracy with inherited weights from the supernet and the test accuracy after re-training for One-Shot NAS. Different from devising a controller to find the best performing architecture with inherited weights, this paper focuses on how to sample architectures to train the supernet to make it more predictive. A single-path supernet is adopted, where only a small part of weights are optimized in each step, to reduce the memory demand greatly. Furthermore, we abandon devising complicated reward based architecture sampling controller, and sample architectures to train supernet based on novelty search. An efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
