Hierarchical Representations for Efficient Architecture Search
Hanxiao Liu, Karen Simonyan, Oriol Vinyals, Chrisantha Fernando, Koray, Kavukcuoglu

TL;DR
This paper introduces a hierarchical genetic algorithm for neural architecture search that efficiently discovers high-performing models, outperforming many manual designs and reducing search time significantly.
Contribution
It presents a novel hierarchical genetic representation and an expressive search space, enabling faster and more effective neural architecture search.
Findings
Achieved 3.6% top-1 error on CIFAR-10
Obtained 20.3% top-1 error on ImageNet
Reduced search time from 36 hours to 1 hour
Abstract
We explore efficient neural architecture search methods and show that a simple yet powerful evolutionary algorithm can discover new architectures with excellent performance. Our approach combines a novel hierarchical genetic representation scheme that imitates the modularized design pattern commonly adopted by human experts, and an expressive search space that supports complex topologies. Our algorithm efficiently discovers architectures that outperform a large number of manually designed models for image classification, obtaining top-1 error of 3.6% on CIFAR-10 and 20.3% when transferred to ImageNet, which is competitive with the best existing neural architecture search approaches. We also present results using random search, achieving 0.3% less top-1 accuracy on CIFAR-10 and 0.1% less on ImageNet whilst reducing the search time from 36 hours down to 1 hour.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
This Neural Network Optimizes Itself | Two Minute Papers #212· youtube
Taxonomy
TopicsEvolutionary Algorithms and Applications · Metaheuristic Optimization Algorithms Research · Machine Learning and Data Classification
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
